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Mathematical Models of Computation 
Using Magnetic Bubble Interactions 


By A. D. FRIEDMAN and P. R. MENON 
(Manuscript received February 10, 1971) 


This paper considers the computational capabilities of different 
mathematical models of magnetic bubble interactions. A specific model 
was studied earlier by R. L. Graham, who showed that there exist 
combinational functions of 11 or more variables that cannot be com- 
puted by this model. This paper extends his results by introducing 
different types of interactions which seem to be practical and enable 
the computation of all combinational functions. The problem of 
efficent computation from the points of view of time and space re- 
quirements and the geometrical requirements imposed by the fact that 
interactions can occur only between physically adjacent locations are 
also examined. Finally, a model in which computations are carried 
out by applying uniform magnetic fields to the entire platelet, with 
individual access limited to locations along the periphery, is presented. 


I. INTRODUCTION 


Cylindrical magnetic domains in certain orthoferrite materials have 
been investigated extensively in recent years.-* These domains, com- 
monly referred to as bubbles, have the property that they can be 
moved within the material by the application of suitable external 
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magnetic fields. The motion of a bubble is also dependent on other 
bubbles in its vicinity. The locations of bubbles can be restricted to 
a finite set of possible positions in an orthoferrite platelet. The pres- 
ence or absence of a bubble at a particular location may be treated as 
representing the values of a binary variable. Thus, magnetic bubbles 
seem to have natural applications in performing memory and logic 
functions. | 

In a recent paper, R. L. Graham‘ considered the computational 
capabilities of a particular mathematical model of magnetic bubble 
interaction. The only type of interaction allowed in this model is 
represented by an instruction of the type (7, y) where x and y are 
distinct adjacent locations in the orthoferrite platelet. Application of 
this instruction results in moving the bubble in location x to location y 
if the latter does not contain a bubble prior to the application of the 
instruction. If x does not contain any bubble or y already has a bubble, 
no transfer of bubbles takes place. Two locations, only one of which 
contains a bubble, are used to represent each binary variable and its 
complement. Graham has shown that only a small fraction of all com- 
binational functions of 11 or more variables can be computed by this 
model. 

In this paper, we review Graham’s results and extend his model 
by introducing different types of interactions which seem to be prac- 
tical and enable the computation of all combinational functions. We 
then examine the problem of efficient computation from the points of 
view of space and time requirements. The geometrical requirements, 
necessitated by the fact that bubble interactions can occur only be- 
tween physically neighboring locations, are also examined. Finally, 
we consider a model in which computations are carried out by apply- 
ing magnetic fields to the entire platelet and individual access is lim- 
ited to locations along the periphery. . 


II: MODELS OF BUBBLE INTERACTIONS 


Let us first consider the model studied. by Graham.* All locations are 
assumed to be adjacent to one another so that interaction between any 
pair of locations is possible. If S is the set of all possible bubble loca- 
tions and n is the number of such locations, let Q be the set of all 2” 
subsets of S. If a subset X ¢ Q containing bubbles represents the values 
of the variables of a function f, then f is a mapping from Q onto Q. 
The function f is realized by applying a program P to S with bubbles 
in locations contained in X. If the set of resultant bubble locations is 
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X”, then X” = f(X) and X’ e Q. The program P consists of a sequence 
of instructions of the form e = (a, b) such that X° = (X — {a}) U {b}, 
if ae X, b ¢ X, and X° = X otherwise. The nondecreasing overlap 
(NDO) theorem due to Shockley and proven in Ref. 4 points out some 
of the limitations of this model of interaction. The NDO theorem is 
repeated below in the interest of completeness. 
Theorem 1 (NDO Theorem): Let X, and X, be two initial sets of locations 
of bubbles and let P be any program (of the type discussed above). Then 
| X40 X45] = | Xi O X.|, where | X | ts the cardinality of the set X. 
The NDO theorem precludes the possibility of certain types of 
computation using this model of bubble interaction. For example, it is 
impossible to have a program P such that {a, b}” = {c, d} and {a}? = 
{e}. Another consequence of the NDO theorem is the nonexistence of a 
replicating program. That is, there exists no program P* such that if 
S and S’ are nonintersecting sets of vertices and X C S, P* creates a 
set X’ C S’ without disturbing X, so that there is a one-to-one corre- 
spondence between X and X’. In order to be able to compute all com- 
binational functions, it may be necessary to do one or both of the 
following: (t) Include other types of interactions. (12) Code the inputs 
and the outputs. 


Let us consider the following set of instruction types, where the 
first type is the one we have discussed above. 


Name Notation Resultant Bubble Locations 
I. Bubble e=(a, b) ee {a}) U {b} ifae X, be X; 
transfer X otherwise. 
II. Bubble e=(a, b)s a {b} if ae X; 
splitting X otherwise. 
III. Bubble e=(0,a) X°=X — {a}. 
annihilation 
IV. Bubble e=(lra) X°=X VU {a}. 
creation 


In the above set, type II instructions are necessary for replication. 
Type III instructions have the property that |X¢| < |X| and are there- 
fore necessary to compute {a, b}” = {c}. Instructions of type IV are 
necessary for performing computations of the type y? = {a}, where ¢ 
is the null set. 

Theorem 2: The four types of instructions (viz., bubble transfer, 
splitting, annihilation, and creation) are not sufficient for performing 
all computations. 
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Proof: Consider a program P such that X” = S — X. Let the first 
instruction of the program be e; . We show that a program P for comput- 
ing S — X using only the four types of instructions discussed above 
does not exist, by showing that for each type of instruction there exist 
two sets X, # X. such that X{° = Xj. Since S — X, ¥ S — X, 
the program cannot compute S — X for all X. 

Case 1: Let e, = (a, b). 


Let X, = (X, = {a}) U {b}; X,, Xe C S;aeX,;b¢X,. Then 
f= Xs =X, and Xi = Xt. 


Case 2: Let e, = (a, b)s. 
Let ae X,,b¢ X,, and X, = X, U {b}. Then 


Xa ke SX and’ Xx Se 


Case 8: Let e, = (0, a). 
Let ae X,, and X, = X, — {a}. Then 


Xf? =Xp =X, and Xi 


Case 4: Let e, = (1, a). 
Let ae X,, and X, = X, — {a}. Then 


PS XS and, a SX 


Thus there exists no program P using only the four types of instruc- 
tions that can compute S:— X for all X C S. This implies that comple- 
mentation cannot be done (using only these types of instructions) 
if the value of a Boolean variable is represented by the presence or 
absence of a bubble in a particular location. 

The other approach mentioned earlier for improving the computa- 
tional capabilities of magnetic bubbles is the use of suitable codes. 
Graham‘ has used two bubble locations x) and 2x, to represent the 
value of a binary variable x. x = 0 is represented by % = 1,2, = 0 
and x = 1 by 2% = 0, x, = 1. However, Graham has shown that this 
representation and the bubble transfer instruction are not sufficient 
for realizing all Boolean functions. By enumerating the number of 
distinct programs, he has proven the following theorem. 

Theorem 3: There exists a Boolean function of 11 variables which 
cannot be realized by a program of bubble transfer instructions. 

It is not known whether there exists some coding of the variables 
which permits the realization of all Boolean functions using only the 
bubble transfer instruction. However, the four types of instructions 
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together with a coding of the type used by Graham are sufficient for 
realizing all Boolean functions. 

Theorem 4: All combinational functions can be computed using the 
four types of instructions and a suitable coding of inputs. 


Proof: Any combinational function can be realized using the following 
method. Let the set of all possible bubble locations be denoted by /, 
where M = SU T and S()\ T = ¢. As before, values of the input vari- 
ables are represented by bubbles in subsets of S, and let each variable 
be represented by two locations x) and 2, . Let T be the set of possible 
bubble locations that serves as temporary storage. Any given combina- 
tional function f and its complement f are expressed in the sum of prod- 
ucts form. The following procedure is used to realize the function f: 


(t) Clear the output locations fy and f; by (0, fo) (0, f1). 

(iz) Clear temporary storage by (0, yio)(0, yi1) for all yoo, Yau e T. 

(iz) Copy the values of the input variables into temporary storage 
using bubble splitting instructions (20, Yio)s(@i, Yis)s for 
all rio, ti @ S. 

(iv) If x* is a variable appearing in a product term, where x* rep- 
resents 2; or €;, the term [[ z* is computed by the set of 
instructions (Yia, Yj) for all 7 ¥ 7 such that x* is contained 
in the product term and a = Oif ct = €,,a = lif vt = z,, 
b= Oif ct = ¢, andb = lif «* = g;. 

(v) (Yia » f1) puts the or of all terms computed so far in f, . 

(vc) If there are additional terms, go to step zi and repeat for next 
term. 

(vit) If there are no more terms in f, repeat steps 77 through v7 for 
f, replacing step » by (yia, fo). 


The above procedure merely shows that all combinational functions 
can be computed using the four types of instructions and the par- 
ticular type of coding used. Both the program and the coding could 
be made more efficient. 

The property of the code that made the computation of all com- 
binational functions possible is the elimination of the need for com- 
plementation, which was shown to be impossible to perform with the 
four types of instructions used. A more general code which has this 
property is an m-out-of-n (m/n) code®> where n bubble locations, out 
of which exactly m contain bubbles, are used to represent each input 
combination. A procedure analogous to that given above may be used 
for computing any combinational function using this code. The case 
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discussed above, where each variable is represented by two bubble 
locations, is a special case of the m/n code and is referred to as the 
autosynchronous code.® 

The m/n code is more efficient than the autosynchronous code in 
terms of the number of bubble locations required for encoding a given 
number of binary variables. Whereas 2k locations are necessary to 
represent k binary variables, the number of locations required with an 
m/n code is such that (%) = 2". Since m = [n/2], where [z] is the integral 
part of z, gives the largest value of (%), we have (,,72;) 2 2". Using 
Stirling’s approximation, we have [n — 4 log. n] 2 k. For large values 
of k, the [n/2]/n code requires fewer locations than the autosynchronous 
code as the following sample values indicate: 


k n([n/2]/n code) 2k (autosynchronous code) 


14 16 28 
29 32 58 
61 64 122 


A given type of instruction is said to be computationally complete 
if all combinational functions can be computed using programs con- 
taining only instructions of the given type, assuming that any loca- 
tion may be initially cleared and a bubble may be initially inserted in 
any location. That is, instructions of the type (0, 7) and (1, 7) may be 
used for initialization, prior to the application of the program. Note 
that the bubble transfer instruction discussed earlier is not computa- 
tionally complete under our definition, although bubble transfer and 
bubble splitting together are sufficient to compute any combinational 
function. (The need for clearing locations within the program in the 
procedure discussed earlier can be eliminated by using a sufficient 
number of temporary locations.) We shall now consider some com- 
putationally complete instruction types that may be realizable by 
bubble interactions. 

Consider an instruction e = (2, y)a, defined by 


X° = X — {y} ifz,yeX 
=(X —{x/)U fy} ifaeX,yegX 
= X otherwise. 


If the presence or absence of a bubble in a location is treated as the 
value of a binary variable, and if the value of a variable after the 
application of an instruction (x, y)¢ is denoted by the corresponding 
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primed variable, the instruction can be represented by the following 
equations: 


ve =azy and y’=2xQy 
where @ represents exclusive-or. 

We prove that the instruction (z, y), is computationally complete 
by showing that we can perform duplication and also realize the NAND 
function using only instructions of this type. It is well known that any 
combinational function can be realized using only two-input NAND 
gates if duplication (fan-out) is allowed. Denoting locations that initially 
contain bubbles by lower-case letters with the subscript 1, the NAND 
of two variables x and y can be realized by the sequence of instructions 
(x, y)a(z, Q1)¢ ; which results in, x’ = zy,y’ = «Q@y, af =1Q@ zy = zy. 
ae can be performed by the sequence (2, b,)2(b; , ¢1)a , resulting 
inv’ = =1@2=4,c =1Qé = z. Thus all combinational 
Pave ee can ‘be computed without special encodings. 

A timing problem associated with this instruction* must Be ee 
Two applications of the instruction (x, y), results in zy(4 @ y) = 
in location z and zy @zx@y = «+ y in location y. Physically, ‘ 

= y = 1 when the instruction (2, y), is applied, the bubble in y is 
destroyed. If the field effecting the execution of the instruction is still 
applied, the bubble in location x is transferred to y. Thus, this instruc- 
tion requires application of the field for a fixed interval of time. We 
may say that the instruction is pulse-width dependent. For each of the 
other instruction types considered, application of the same instruction 
a second time does not result in any further change in either location. 

Another type of computationally complete instruction is the condi- 
tional transfert denoted by e = (a, y)z and defined by 


= (X — {x}) U ty} ifz,zeX,y¢X 
= X otherwise. 


Duplication can be performed by the instruction (a; , a)” where do 
and a, denote an empty location and a location with a bubble respec- 
tively. After the application of the instruction, locations a) and x will 
contain the variable x. Similarly, the sequence (2, y)b,, (b:1, 60), 


* In practice, there exists an interaction involving three locations 2, y, z, such 
that ify = 0, y = 2 Q@z2’ = 2’ = az. The timing problem mentioned above 
does not occur in this case. 

+ The interaction involving three locations, mentioned in the previous footnote, 
may be represented by the following sequence of conditional transfer instructions, 
if y = 0 initially: (a, y)z; (z, y)z. The order of execution of these two instructions 
is Immaterial. 
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yields the function zy in location b, . All combinational functions can 
be computed using only conditional bubble transfers without special 
encodings. 


III. TIME AND SPACE CONSIDERATIONS 


In the preceding section, we showed how any combinational function 
can be computed using magnetic bubble interactions, assuming that 
interactions are possible between any pair of bubble locations. Since 
interactions can occur only between bubbles in physically adjacent 
locations, this implies that bubbles that are required to interact are 
brought into adjacent locations prior to the application of the instruc- 
tion. The time taken for computing a function depends not only on the 
number of instructions in the program but also on the layout of the 
bubble locations. It is also necessary to be able to move a bubble from 
any arbitrary location to any other location without affecting the 
bubbles in other locations. We shall examine these problems in this 
section. 


3.1 Bounds on Memory Requirements 


Consider the layout shown in Fig. 1. If the locations in the shaded 
area are used as bubble locations for a program, the locations in the 
unshaded part can be used as transit points. These memory locations 
are initialized so as not to contain bubbles. Denoting the shaded and 
unshaded regions by S and T respectively, an instruction of the form 
(a, b) a, b e S will be replaced by a sequence of instructions 


(a, 2:)(Hs  Viar) +0 (Lia, Bi) (@; yO) (Hi, Zina) +++ (Wear , Bi) (Hi, @) 
where 
Ui» Vita, 9° Vj 


are adjacent locations in 7’ so as not to affect any other location in S. 
With this layout 3n/2 — 1 locations are required for n locations which 
are useful for computation. (We shall refer to these as data. points.) 
If p is the total number of locations, the maximum number of data 
points can be shown to approach 2p/3 asymptotically (K. C. Knowl- 
ton, private communication). 

The maximum path length (one-way) between two data points is 
n/2. The maximum path length can be greatly reduced at the expense 
of a slight increase in the required memory (i.e., total number of loca- 
tions) by the layout shown in Fig. 2a. The total number of locations 


MAGNETIC BUBBLE INTERACTIONS 1709 


WILLE 
7 /, 
MAL 


Fig. 1—Memory layout. 


is (n/2k — 3)-3k + 12k = 3n/2 + 3k, and the number of data points 
is 2:3h + (k — 1)-(n/k — 6) + 2(n/2k — 3) = n. The maximum 
path length (one-way) is n/2k + 3k — 1. [By changing the corners 
as shown in Fig. 2b, the maximum one-way path length can be reduced 
by 2, and the total number of locations can be reduced by 4 
(M. D. Mellroy, private communication).] The maximum path length 
is dependent on k. Since k must be an integer and 2k must divide n, 
the maximum path length is minimal when k = a/ n/6 if this is an 
integer. Thus if n = 6k” the maximum path length for the layout of 
Fig. 2a is (1/6 /n — 1). Some typical values of n and the minimum 
maximal path length for various values of k are shown below: 


k n path length 
2 24 11 
3 54 17 
4 96 23 
5 150 29 





Fig. 2—Memory layout to reduce maximum path length. 
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In order to obtain a lower bound on the maximum path length for 
a memory with 7 locations, allowing interactions between any pair 
of locations, consider the closest packing of these locations in two- 
dimensional space, which is a circle. If the radius of the circle is 1, 
wr = norr = V n/r. The maximum distance between two points 
is 2r = ~W/4n/r. Clearly, this bound cannot be attained because all 
points in the circle are treated as data points in deriving the bound. 
Thus the best we can hope to do is to obtain a maximum path length 
of C Wn where C > ~/4/z. The coding of Fig. 2 has maximum path 
length < +/6 Wn and hence requires approximately twice as much 
time as the lower bound. . 

The memory layouts discussed above allow interactions between 
all pairs of data points and are therefore suitable for realizing any 
arbitrary program. For realizing any specific function, interactions 
would be necessary only between a subset of these pairs of data points 
and consequently a smaller memory would suffice. The maximum path 
length can also be reduced in general. In this case, the memory require- 
ments and maximum path lengths derived above serve only as upper 
bounds. 


3.2 Number of Instructions for Realizing a Function 


The number of instructions and bubble locations required for realizing 

any given function using any given type(s) of instructions may vary 
over a wide range depending upon the realization, as shown in the 
following examples. 
Example 1: Consider the computation of f = x @ y, using bubble 
transfer instructions and bubble splitting, if necessary. Let two loca- 
tions v» and x, be used to represent each variable z, as discussed earlier. 
Using the canonical realization in the sum of products form discussed 
earlier, four minterms (two each for the function and its complement) 
have to be computed. Each minterm is computed using four bubble 
splitting instructions and one bubble transfer instruction. Finally, 
two bubble transfer instructions are required to compute the sum of 
two minterms. Thus the total number of instructions in the program 
willbe 5 X 4+ 2 X 2 = 24. The total number of data points required 
is 10 (4 for x and y, 2 for f, and 4 temporary storage). Using the layout 
of Fig. 1, 14 bubble locations are required in the memory. 

The following program containing only six instructions also com- 
putes this function, if fo and f; are both initially empty: 


P = (a1, Y1) (o , Yo) (% , fo) (Xo ’ fo) (yr » Yo) (Yn » fi). 
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Six data points are necessary, and these can be obtained using eight 
locations and the layout of Fig. 1. Since interaction between all pairs 
of data points is not required by the program, it can be shown that 
seven locations are necessary and sufficient to permit the required 
interactions. However, the following program which also contains six 
instructions and requires six data points can be realized with only six 
bubble locations: 


P' = (a, y1)(% , Yo) (21 , Lo) (Xo , fo) (Yo » 1) Yo ; fi). 


Figure 3 shows the memory layout for this program. 

This example shows that the canonical realization may be inefficient 
in the number of instructions, computation time, and also memory 
requirements. The inefficiency of the canonical realization becomes 
even more apparent if we compare the canonical realization of f = 
t, @ t2 @ --- @ a, with the realization in the form [(71 ® 2) @® 
xz; -+: @ 2,], where each exclusive-or operation is performed by a 
program of the type given above. 

The above example points out several open problems associated 
with the design of efficient programs for computing combinational 
functions. One problem is that of obtaining a program with the 
smallest number of instructions for computing a given function. For 
a given program, the assignment of locations so as to minimize the 
total number of locations required for its implementation is also an 
important problem. Since the number of memory locations required is 
dependent not only on the number of instructions, but also on the 
specific program, another open problem is that of obtaining a program 
with minimum memory requirements. 

The problem of minimizing the number of instructions in a program 
for computing a combinational function corresponds to that of finding 
a realization of the function using the smallest number of modules of 
given types. For example, the bubble transfer instruction can be repre- 
sented by the module shown in Fig. 4a. The bubble location at which 
each output appears is given in parentheses. Note that fan-out is not 





Fig. 3—Memory layout for program P’. 
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allowed in the realization. Bubble splitting is represented by the 
module of Fig. 4b. Instructions of the type (x, y)¢ and the conditional 
transfer (x, y)z can be represented by modules of Fig. 4c and d 
respectively. Creation and annihilation of bubbles can be represented 
by appropriate constant inputs (0 or 1). If a realization of the func- 
tion can be obtained using a subset of these module types, then the 
realization can be readily translated into a program which computes 
the function. All bubble locations used in the program will correspond 
to inputs of the circuit. Although efficient realizations of functions 
using only modules representing the allowed bubble interactions lead 
directly to efficient bubble programs, there are at present no known 
algorithms for the former. The following example shows how a pro- 
gram is obtained from a modular realization of a function. 

Example 2: Consider the function f = aé + bed + béd, which is to 
be computed using the bubble transfer instruction only. Let each 
variable be represented by two locations. Thus, we wish to realize f 
and f using only modules of Fig. 4a, with no fan-out. The inputs to 
the circuit consist of the variables and their complements. f = 
cd + be + déd + abé = c(b + d) + aé(b + d). We obtain the circuit 
of Fig. 5 (by trial) which realizes both f and f. Translating each module 
of Fig. 5 to the corresponding instruction, we obtain the following 
program: 


P = (bo , do) (do , 1) (01 , di) (Go ; Co) (di , Go) (do , di) 
(c, , bi) (Bo , Go) (Co , G1) (Bo , Co) (C1 5 Co). 


At the end of the program f and f will be in cy and d, respectively. 
In translating the circuit to the program, no module should be trans- 
lated to the corresponding instruction until the modules connected 


(x) (x) 


x xy x x 
y (y) 2+y y (y) +y 
(a) (b) 
(x) (x) = 
x xy x LY + LZ 
(y) y a X2+Y 
y x@Y Z (Z Z 
(c) (d) 


Fig. 4—Modules representing instructions: (a) Bubble transfer, (2, y). (b) 
Bubble splitting, (z, y)s. (c) Type (2, y)a. (d) Conditional transfer (z, y)z. 
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Ql oi 


O1 ol 





Fig. 5—Circuit for example 2. 


to its input terminals have been translated. It is also important to 
observe the proper correspondence between bubble locations and the 
output terminals of a module. After applying an instruction (2, y), 
zy appears at location x, and x + y appears at location y. 

The other types of modules shown in Fig. 4 can be used in a sim- 
ilar manner for deriving programs using other types of instructions. 
However, there are no known algorithms for realizing any arbitrary 
function using only modules of a given type, so as to minimize the 
number of modules used. When such algorithms become available they 
can also be used for obtaining efficient programs of magnetic bubble 
interactions. 


3.3 Speed-up of Computations 


We have seen how a combinational function can be computed by 
different programs having different numbers of instructions and 
memory requirements. If the instructions in a program are executed 
one at a time, as was implicitly assumed so far, then the time for 
computing a function depends on the number of instructions in the 
program. However, it may be possible to speed up the computation 
by executing several instructions simultaneously. 

An obvious method of obtaining parellelism in a program is by 
executing independent instructions simultaneously. Instructions that 
may be executed simultaneously can be determined from the circuit 
corresponding to the program, discussed in the preceding section. The 
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modules in the circuit are arranged according to levels as follows: 
‘Modules whose inputs are only input variables (or their complements) 
are assigned to level 1. The level of any module is defined to be the 
smallest integer greater than the levels of the modules connected to its 
input terminals. With levels assigned to all modules in the circuit in 
this manner, the instructions corresponding to these modules are 
executed in the order of the level numbers, and the instructions cor- 
responding to all modules in one level may be executed simultaneously. 
However this may necessitate a further restriction on a valid memory 
allocation scheme. 

Example 3: Consider the realization of the function f = aé + bcd + béd 
in Fig. 5. The modules can be arranged according to levels as follows: 
level 1-(1, 3, 4); level 2—(2, 5, 9); level 38-(6, 7, 8); level 4-(10); level 
5-(11). The program P can be executed in five steps as follows: 


(7) (bo , do) (b1 , di) (Go , Co). 
(it) (do , €1) (di , Go) (Co , G1). 
(tz) (do , d1) (€1 , b1) (Bo , Qo). 
(iv) (Bo , Co). 

(v) (C1 , Co). 


Another situation where simultaneous execution is possible becomes 
obvious by considering the instructions (a, b)(a, c). If these instructions 
are executed simultaneously, the location a will contain abc, whereas 
the contents of locations b and ¢ are indeterminate because they depend 
on which of the two instructions is actually executed first. However, 
if we are interested only in the product term abc, the two instructions 
may be executed simultaneously. In general, all (n — 1) instructions 
for forming a product of n variables can be executed simultaneously. 
Similarly, the simultaneous execution of the instructions (a, s) (b, s) ---, 
leaves the sum a + 6 + --- +. 8 in location s with the contents of 
a, b, --+ indeterminate. Using this technique, the program of Example 3 
can be executed in four steps by replacing the instructions of steps 
iv and v by (bo , Co) (C, , Co), executed simultaneously. Instructions of 
the type (x, y)z can also be executed simultaneously with similar results. 
For instance, the simultaneous execution of (a, b)z and (a, c)z leaves 
the product abc in location a, with the contents of locations b and c¢ 
indeterminate. Similarly, if (a, c), and (b, c)z are executed simul- 
taneously, the location c will contain a @ b @ c, but the contents of 
locations a and b will be indeterminate. 
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It may also be possible to speed up computations in a manner sim- 
ilar to the use of completion signals in asynchronous circuits.” A 
combinational function f can be expressed as a sum of disjoint 
minterms. The function can be computed by computing these min- 
terms, one at a time. As soon as a 1 is generated, the computation is, 
in effect, complete because the value of the function is known to be 1. 
The computation can be terminated if the presence of a bubble in 
the output location can be detected by the control circuitry. The con- 
ditional transfer instruction (x, y)z discussed earlier may be useful 
for this purpose. 


3.4 Iterative Array Realizations 


Our discussions so far have been restricted to realizations which 
minimize the number of instructions or the computation time. In this 
section, we shall consider some techniques for obtaining regular struc- 
tures, which are capable of performing computations by the applica- 
tion of uniform magnetic fields. In such a structure, the same type 
of instruction is executed simultaneously in all parts by the applica- 
tion of a uniform magnetic field, the type of instruction being con- 
trolled possibly by the direction of the magnetic field. 

Since regularity of interconnections is a highly desirable feature in 
integrated circuits also, a great deal of effort has been directed toward 
the realization of functions as iterative arrays.*® Such arrays may 
be classified as uniform cell function arrays (in which the function 
realized by each cell is identical and different functions are realized 
by the array by changing the inputs to some cells) and nonuniform 
cell function arrays in which different cells realize different functions, 
depending on the position of the cell in the array. The operation of 
uniform cell function arrays can be simulated by magnetic bubble 
interactions with uniform magnetic fields. 

The rectangular array shown in Fig. 6a may be used for realizing 
any combinational function of n variables by merely changing the 
connections to the last row of cells. A typical cell in the array is 
shown in Fig. 6b. It can be shown that the vertical outputs of the n™ 
row consist of the 2” minterms of 7 variables. Since at most one of 
the minterms will be 1 at any time, the last row will compute the 
sum (oR) of all minterms connected to it. Any function can be realized 
by connecting to the cells in the last row only those vertical outputs 
of the n™ row that correspond to 1-points of the function. 

Note that the number of cells in the array is (n + 1)2”. The longest 
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Fig. 6—(a) Rectangular array. (b) Typical cell in array. 


path in the array is 2” + n + 1. Therefore, the time required for 
computing a function of n variables is proportional to 27 + n + 1. 
The operation of this array can be simulated by magnetic bubble 
interactions as follows: Let each cell in the array be replaced by two 
bubble locations as shown in Fig. 7. Let the application of a magnetic 
field to a cell cause the execution of the instruction (z, y), and transfer 
of the resultant bubbles as shown by the arrows in Fig. 7. (It may be 
necessary to apply a magnetic field in one direction followed by a field 
in a different direction to accomplish this.) Let the cells in the 7th row 
and jth column be denoted by C;; . If the left and right locations in 
C;; represent x and y respectively, application of the field to C;; results 
in zy in the right location of C;,1,; and x @ y in the left location of 
C;.;41 - Initially, we set the left locations of all cells in the first column, 
except the cell in the last row, to correspond to the values of the n 
variables. The left location of the first cell in the last row should be 
empty. The right locations of all cells in the first row initially contain 
bubbles. Paths from the nth row to the (n + 1)st row which correspond 
to minterms for which the function is 0 are inhibited. At ¢ = 1, the 
field is applied to Ci, . At ¢ = 2 the field is applied to Cy. and C’, ; 
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at t = 3 to Ci, , Co, and C3,; and so on. Thatis,att =k, lSks 
2” + n, the field is applied to all cells C;; such that 7 + 7 = k + 1. At 
t = 2” +n +1, the value of the function will be in the right location 
of Crt ,2" . 

Instead of applying fields to successive diagonals, the fields may be 
applied to the entire platelet. Now, the right locations of all cells in 
the top row should be connected to “bubble generators,” so that they 
always contain bubbles. The values of the n variables, contained in 
the left locations in the first column, may be changed every n. units 
of time. The output corresponding to any input combination will 
appear at the output location 2” + n + 1 units of time after the 
application of the input combination. If input changes occur n units 
of time apart, the correct outputs will also appear 7» units of time 
apart. However, the contents of the output location will not be correct 
between these fixed instants of time. Therefore, it is necessary to read 
the output at the appropriate instants of time. 

The preceding discussion shows how a uniform cell function iterative 
array can be simulated by magnetic bubbles by applying identical 
sequences of instructions to successive sets of bubble locations. This 
provides us with a systematic way of realizing all combinational func- 
tions. If conditional operations are permitted, nonuniform cell func- 
tion arrays can also be simulated. Such arrays can be made smaller 
than uniform cell function arrays. The number of cells required for 
realizing any arbitrary function of n variables is proportional to 2° 
with nonuniform arrays compared to n-2" required for uniform cell 
arrays.® However, the cells of the nonuniform array are likely to be 
more complex than those in a uniform array. 

It is interesting to compare the size of array realizations to the size 
of realizations without any restrictions on the interconnection struc- 
ture. Bounds on the latter will provide us with an indication of the 
space and time requirements for computing an arbitrary combina- 





Fig. 7—Simulation of array by magnetic bubble interactions. 
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tional function with magnetic bubbles. D. E. Muller® has shown that 
for any set of elements capable of realizing all combinational func- 
tions, the number of elements N for realizing any arbitrary function 
of n variables satisfies the inequality 


<v so 
Nn 


a 
n 

where C, and C2 are constants whose values depend on the set of 
elements used. The proof of the upper bound is constructive and uti- 
lizes only one type of cell. It is outlined below because of its possible 
applicability to magnetic bubble computations. 

The cell used realizes a function of three inputs ab + dc. Any function 
of k variables can be expressed as 


[G54 tas as ¢ ay) = arf (r1 5 Xe, vce sidpeaey kD 
+ Hf (21, 2 , iat , Le-1 , 0). 


If all functions of & — 1 variables are available, every function of k 
variables can be realized using exactly one cell. There are 2” functions 
of k variables (which also include all functions of k — 1 variables, 
k — 2 variables, etc). Thus 2” cells are sufficient for realizing all func- 
tions of k variables. 

By expanding about any variable, the output function can be formed 
from two functions of n — 1 variables. Repeating the procedure, it 
may be formed from 2” functions of k variables, using 2” * — 1 ele- 
ments. Since 2” elements are sufficient for realizing all functions of 
k, variables, 


N = 2% +97" — 1, 


The value of k may now be chosen so as to minimize N. An approximate 
minimum is N < (,(2"/n), for some constant C, . Thus the bound 
on the number of cells required is greatly reduced for nonregular 
structures. 


IV. SUMMARY 


Three different mathematical models of magnetic bubble interac- 
tions were studied. and each of them was shown to be sufficient for 
computing all combinational functions. Some methods of speeding up 
computations were presented. Geometrical patterns in which it is 
possible to move bubbles between any pair of data locations and also 
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minimize the maximum path length were examined. A uniform struc- 
ture for computing arbitrary functions was also presented. These 
structures have the advantage that computations can be carried out 
by the application of uniform magnetic fields to the entire platelet. 
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A Nonlinear Diffusion Analysis of 


Charge-Coupled-Device Transfer 


By R. J. STRAIN and N. L. SCHRYER 
(Manuscript received January 25, 1971) 


In those charge-coupled devices (CCD’s) which have regions under 
each electrode which are substantially free of externally applied 
tangential electric fields, charge motion takes place by space charge 
assisted diffusion, and this relatively slow process represents a limita- 
tion to the operating frequency of CCD’s with large plates. In this 
paper, subject to certain approximations, the equations of motion for 
CCD charge transfer have been derwed, yielding a nonlinear diffusion 
equation. The solution of this equation by a stable finite difference 
scheme is described, and the solutions are applied to predicting the 
operating characteristics of CCD’s. The results in synopsis are: 


(1) The n-channel devices will have lower losses at a given frequency 
than the p-channel devices, and a higher upper frequency limit. 

(it) Higher amplitude signals (more charge) yield lower losses. 

(21t) Losses can be reduced an order of magnitude by using zZBRO’s 
which carry a substantial amount of charge. For example, with 
2-MHz clocking of a 25-ym plate (pad), p-channel 2-phase 
device, a 2-volt ONE 1s diminished by 4 percent per transfer with 
empty ZERO’s, but with 1-volt zERO’s, the diminution ts 0.26 
percent per transfer. 

(w) Reasonably efficient CCD operation should be possible up to the 
50-M Hz range using contemporary design tolerances. 

(v) Diffusion is important for reaching high transfer efficiencies. 

The frequency limitations described in this paper can be overcome 

by using a structure in which the distance between. the electrodes 
and transferring charge is comparable to the electrode width and 
spacing. 


I. INTRODUCTION 


The charge-coupled device, as conceived by Boyle and Smith? and 
realized by Amelio, Tompsett, and Smith,?* consists of a series of 
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metal oxide semiconductor (MOS) capacitors that are driven by clock 
pulses into deep depletion. In this condition, the MOS structures are 
capable of holding minority carriers in the potential wells beneath 
the plates (pads). In order to establish a transfer of charge from one 
plate to another, it is necessary to make the potential well beneath 
the second plate deeper than that beneath the first. This is illustrated 
in Fig. 1. In the absence of charge, the potential configuration beneath 
any one plate would be essentially flat, and finite electric fields would 
exist only in that span between two CCD electrodes. When charge is 
present, its transfer is driven predominately by the electrostatic forces 
associated with the presence of the charge in the CCD and by the 
thermal forces responsible for diffusion. It is the purpose of this paper 
to describe the transport of charge under the influence of these forces 
and, using this description, to make predictions concerning the opera- 
tion of CCD’s. In order to do this, it will be necessary first to derive 
the equation governing the transport of charge and solve it. Then the 
solutions will be applied to some particular CCD situations. 


II. DERIVATION OF TRANSPORT EQUATION 


The derivation of the transport equation will utilize a number of 
approximations. The first of these is a linear approximation of 
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__ Fig. 1—A diagram of a CCD cross section and surface potential diagrams 
illustrating charge storage and charge transfer in CCD’s. 
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the surface potential versus applied voltage relationship. The one- 
dimensional relationship between surface potential g¢ and applied 
voltage V4 is described by this equation: 


Via Vo WE Ve" 





4) —4 
Vis Va- Ve =o | 
(1) 
Vo = deN p = : 
€or 
Coz s a 


In this equation Vrg is the MOS flatband voltage, and q is the surface 
density of mobile charge stored at the interface between the silicon 
and silicon dioxide. The constant Vy depends upon the oxide thickness 
§, the doping density Np, the dielectric constant ¢,, of the oxide, and 
the dielectric constant ¢, of the silicon. In the subseqeunt calculations, 
however, an approximation to this relationship will be used which is 
linear in Vy and q. This approximation is 


= + AV — Vrs + A). (2) 
In this case y is a number, typically between 0.7 and 1, which matches 
equation (2) to equation (1) over the range of anticipated operation. 
Such a match is shown in the example of Fig. 2, which shows the sur- 
face potential versus applied voltage reduced by the flatband voltage 
and. q/C,, both in the precise form and in the linear approximation. 
It may be seen in this case that the linear approximation is rather 
good over the range from approximately 4 to 15 volts. If the electric 
field parallel to the Si-SiO., interface is calculated from the approxi- 
mate equation, it is found to depend upon the derivative of g. There 
is, however, another term to the tangential electric field; this term 
arises from the. electrostatic repulsion of the carriers. In a two- 
dimensional approximation, the tangential electric field at x will have a 
component due to the summation of the fields of line charges located 
at other points x’ along the surface. Because there is a metallic elec- 
trode on the surface of the SiO», each elemental line charge has an 
image, and the result is a shielding of the tangential field. Taking 
account of the two different dielectrics and the image charge, the 
repulsion field Ez can be written as this integral: 
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Fig. 2—The exact variation of deep depletion surface potential i in an MOS with 
1000A of SiO. over a substrate with 5 x 1014 ionized impurities per cm3, and 
the linear approximation @ « yV’. 





Bs (t) = wee +6) f ae oo - 5A dex’. (3) 


Since the principal contribution to this integral comes from the small 
region within a few 8 of z, it is appropriate to expand q(2’) as a 
Taylor series about x. If this is done, and the limits of integration 
are extended over all x’, all even terms in the expansion vanish by 
symmetry, and to the extent that 0°q/dx*® and higher derivatives are 
negligible, the repulsion field becomes a local function of dq/dzx. 


25 06 


E;(x) = Tea ae (4) 


If one takes the gradient of equation (2) and adds the repulsion field 
in equation (4), the total electric field acting on the charge under the 
CCD plate is given by 
—~ 9 IY 
£, = —S ae 


(5) 
_ (v8 —2_}, 
oF (2 + Cr €og 
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Typical values of both y and the net elastance S are presented in Figs. 
3 and 4 as functions of the oxide thickness 8. Ordinarily S is approxi- 
mately equivalent to the reciprocal of the oxide capacitance; conse- 
quently, Fig. 4 has been plotted in such a way that the value of S is 
compared with the oxide capacitance. 

The transport equation for CCDs will be based on the divergence 
equation for current 


sf = -V-d, (6) 
using the relationship 
J = qE -— DS (7) 


for the value of the current density J. In this equation D is the 
diffusion coefficient appropriate to the surface, and p is the surface 
mobility. Combining equations (5), (6), and (7) gives the basic equa- 
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Fig. 3—The proportionality constant y as a function of oxide thickness, using 
doping density as a parameter. 


1726 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


1.2 


-1 


Np =5x 10 em-3 


1.0 2x10%cm-3 


0.9 


0.8 


Np = 5 x10'4 cm=? 


NORMALIZED SURFACE CAPACITANCE (SCoz ) 





ie) 1000 2000 3000 4000 5000 
OXIDE THICKNESS IN ANGSTROMS 


Fig. 4—The effective surface capacitance, 1/S (the reciprocal of elastance S) 
normalized by Coz and presented as a function of oxide thickness. 


tion for charge transport in a CCD: 


aq _ (22) Hq 7 a. 
at = pS Ox + quis ax” = D aa" (8) 
This may be simplified somewhat by noting the Einstein relationship 
between diffusion ‘and mobility. | 


D= uh. (9) 


Here «7 and e all assume their traditional values as Boltzmann’s 
constant, absolute temperature, and electronic charge. Using this sub- 
stitution, one reaches this equation: 


KT dq. 


ag _ (22) aq 
at BS ox uO ax” (10) 


The last term on the right of equation (10) is simply the diffusion 
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term. The resemblance between this diffusion term and the other term 
involving the second partial of charge is quite apparent, because the 
product Sq, like «7'/e, is a voltage. 

Without boundary conditions, the problem is only half specified. 
The physical situation on which the boundary conditions are based is 
shown in Fig. 5. In Fig. 5a, a CCD plate is shown storing charge. The 
potential beneath the plate and the charge density are both uniform. 
In Fig. 5b, another potential well has been impressed immediately 
adjacent to the first potential well. The second well is considerably 
deeper than the first; as a consequence, charge will start to flow from 
the first potential well into the second under the control of equation 
(8). No charge, however, flows in the opposite direction. Between the 
two potential wells there is an abrupt step in potential. This means 
that there is an extremely high electric field, and charge in that 
vicinity will move with a very high velocity. Using the preceding facts, 
it is possible to approximate the physical boundary conditions in the 
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Fig. 5—An illustration of the length definition and the surface potential con- 
figuration in equilibrium, and at ¢ = 0, when the transfer is initiated. 
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following way. 


0g 
ap te as x—0 (11) 


q=0 at «=D. 

Both of these boundary conditions apply for all time. The first ex- 
presses the fact that current is unable to flow to the left in the device, 
and the second presumes either an ideal sink for charge at x = JL, or 
alternatively that charge moves away from that point at an infinite 
velocity. The initial condition is a value of q at time ¢ = O for all 
points except x = L; it will be assumed that the charge is evenly 
distributed with the magnitude qy. 

Before proceeding with the solution to this equation, it will be con- 
venient to scale the variables. The scaling can be effected by applying 
the following definitions: 


t t 

T= LP = s (12a) 
KTo 

Uy = 1 volt 
x 

yey (12b) 
S 

u= a (12c) 


Time is now represented by the variable 7, which scales time against 
t, and y is the dimensionless variable representing length. In the 
definitions of 7 and 79 , the voltage unit wo is introduced. If this equa- 
_tion had a natural voltage unit, wo would be defined as that unit. 
However, there is no natural voltage scale to equation (10), even 
though the thermal voltage «7’/e appears in one term; consequently, 
it has proven convenient to define w to be unity. The quantity wu 
representing charge comes about because the product of the elastances 
and charge q is equivalent to a voltage. In the solutions which follow 
7 will typically range from 10 to 107, y will range from 0 to 1, and u 
will range from 10° to 10. 


III. SOLUTION OF THE EQUATION 
The scaled equation has the form 
u, =u, + (ut au, (13) 
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where a = x7'/eu is a constant. The problem may be simplified by 
letting w = u + a to obtain 


WU, = W, + w,,W (14) 
Syl 
Srso 
subject to 
w,(0, 7) = 0 
w(l,7) =a 
; < 
veg = fe ee (15) 
a, y=l 
i, =» De 
Uo 


where wu; is determined by the initial uniform charge density qo. 

The nonlinear parabolic initial boundary value problem given by 
equation (14) and equation (15) may be solved using a finite differ- 
ence scheme. The scheme comes from a more general technique for 
solving systems of coupled nonlinear parabolic equations which results 
in linear difference equations. This more general scheme will be pub- 
lished at a later date. 


Let h be the space mesh size and Ar be the time step. Let w; = 
w(jh, kAr). The indices j and k describe position and time respectively. 
The difference equation is then: 


wit! — ut 
Ar 


k+1 k+l ok k k k k 
1 he Wye) War  Wi-1 nei Wye1 + Wj-1 — 20; 
7 


9 Oh Oh —e he 


k k k+1 k+1 k+i k+1 
Wi41 — Wj-1 Wjai T Wj-1 x Wis tw 


j-1 au 
oh 5 ae a! he » (16) 


forj = 1,---, J — 1, whereh = 1/J. The boundary conditions reduce 
to wh*? = wi*! and wt! = a. 

This is a direct generalization of the Crank-Nicholson scheme for 
solving the heat equation, w; = w,,. For a description of that scheme 
and its properties, see Ref. 4, pp. 185-193. 


The most important properties of (16) come from the time and 
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space averaging which is the hallmark of the Crank-Nicholson scheme. 
The left-hand side of (16) is close to the value of w:(jh, (Kk + 1/2) At) 
and the right-hand side of (16) has been averaged to be close to 


(w,(jh, (& + 1/2)A1))* + w(gh, (& + 1/2)At)w,,(jh, (b + 1/2) At). 


The averaging has been done in such a way as to make wit] , w)*} 
and wi** appear linearly. This averaging gives (16) two nice properties. 
First, if a solution of (14) is inserted in (16), and everything is expanded 
in a Taylor series about (jh, (k + 1/2)At), then equality of the right- 
and left-hand sides follows up to terms of order 0(h”) and 0(At’). That 
is, the scheme is accurate to second-order in both time and space. 
In practice, this means that the difference between the computed 
w; and w(jh, kAt), the true solution at the mesh points, will be 0(h’) 
+ O(At’). Second, (16) is a linear system of equations for the w**?. 
In fact, equation (16) is equivalent to 


A A 
witt(—45 ah. — wh) — 45 ut) 


+ wie = a (Wis. + Wier — 2w;) + a vi) 


+ uttt(25 has — wha) — Sout) = wh. 
This has the form 
A ;Wys1 af Byw; sis Cywy-1 = D; j = 1, 2, a J — 1, (18) 


with w; = a and Wo = Wj, reflecting the boundary conditions. This is 
a tridiagonal system of equations for the wy and may be solved quite 
efficiently using Gaussian elimination (see Ref. 4, pp. 198-201). The 
method is easily described. Let 


Ww; = Bywjar + PF; (19) 


forj7 = 0,1, --: , J — 1. Replacing w;_1 by E;.w + F;_,1 in (18) 
gives wj;41 in terms of w;. Comparing this relation with (19) gives 


Po ee 

; B; ois C;E;-1 (20) 
F.= D; = CF 3-1 

B+ CE js 


forj = 0,--: ,J — 1. Since wy = wo = How, + Fo we see that Hy = 1 
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and Fy = 0. This completely determines wo, ++: , wy using (20) for 
j=1,-:-,J/—1, and then (19) forj = J —1,--- , 0. 


IV. RESULTS AND INTERPRETATION 


In applying the results of this analysis to CCD operation, the most 
important operating characteristic is the amount of charge remaining 
behind after transfer from one plate to the next in a limited time. 
The remaining charge is plotted as a function of time in Fig. 6, using 
two different initial conditions, Sgo = 10 volts and Sqo = 2 volts. The 
nonlinear equation which has been solved reduces essentially to the 
diffusion equation after most of the initial q has been dissipated, and 
the solutions in this domain approach straight lines of log(charge) 
versus time when the average value of Sq is much less than xT’/e. 
However, at short time 7 < 1 the solution is effectively driven by the 
charge so the time rate of change for Sqo = 10 volts is roughly five 
times the time rate of change associated with Sqg = 2 volts. Between 
these. two domains is a transition region where the average value of Sq 
lies between 0.2 «7’/e and 5 «T'/e. Here the curves are very nearly 
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Fig. 6—The fraction of the charge remaining under the first plate as a function 
of the normalized time r = t/7ro. The abscissa is plotted as 71/2, which spreads 
the region where + < 1, and leads to straight lines over the transition portion of 
the characteristic. 
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straight lines of log(charge) versus 7?. The different regimes reflect 
a feature mentioned when the equation was scaled; i.e., there is no 
single natural potential to use in forming the time scale. For very 
short times, the natural potential is the initial value of Sq. For times 
much greater than one, the natural potential is «7’/e. Figure 7 shows 
the transfer of charge with and without the aid of diffusion. It is clear 
that the last 0.1 volt of scaled charge receives a considerable boost 
from xT’/e. 

In Fig. 6 it can be seen that in the vicinity of 7 = 10 the charge 
remaining diminishes to approximately 1 percent of its initial value, 
and in the vicinity of 7 = 30 the charge diminishes to 0.1 percent. 
These results are meaningless without some concept of the size of 7. 
Figure 8 helps to alleviate this shortcoming by showing 7» as a func- 
tion of the plate width L for n-channel and p-channel devices. The 
n-channel mobility has been taken to be 750 and the p-channel mo- 
bility 150 cm?/volt-second.® From Fig. 8 it is possible to see that the 
values of 7» range from approximately 1 ns to 1 us for plates ranging 
from 10 to 100 um in width. To take a specific example, a 50-m_ plate 
in a p-channel device would have a 7» of one-sixth of a microsecond. 
If operation were desired at a value of 7 = 10, aiming for a transfer 
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Fig. 7—The fraction of the charge remaining for initial values of Sq. = 1 volt 
with room temperature diffusion and no diffusion («7’/e = 0). 
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Fig. 8—A plot of the time scale factor 7 as a function of the CCD plate 
length L. 


efficiency near 99 percent, then the operating frequency of a 2-phase 
CCD would be roughly 300 kHz, and the same loss per transfer 
would be achieved in a 3-phase CCD at 200 kHz. In an n-channel 
device with the same loss, those frequencies would increase to 1.5 and 
1 MHz respectively. 

In order to illustrate the effect of frequency, another example is 
shown in Fig. 9, where the loss per transfer is plotted as a function 
of clock frequency for a p-channel 2-phase CCD operating with square 
waves. In this type of operation, higher signal levels result in higher 
transfer efficiencies. The plate width in this case was chosen to be 
25 wm, and at 1 MHz the loss per transfer ranges from about 0.3 
percent for Sq) = 10 to 1.5 percent for Sqy = 2 volts. 

The figures quoted so far must be regarded as worst-case loss 
figures. They apply to an isolated cluster of charge being propagated 
through a CCD. It will be possible to reduce this loss by approxi- 
mately an order of magnitude by arranging the information propa- 
gated through the device in such a way that no information ever 
leads to a totally empty CCD cell. This will be effective, because for 
long times, i.e., 7 > 10, the charge remaining behind is substantially 
independent of the amount of charge initially put under a pad. This 
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Fig. 9—The fractional loss per transfer of a P-channel 2-phase CCD, with a 
field- kes length L of 25 um, plotted against clock frequency with the initial onz 
signal as a parameter. The zero’s have no charge. - 


is illustrated by Fig. 10, which shows the absolute magnitude of 
remaining charge as a function of the charge which was initially 
placed in the device. In this figure, the amount of charge is described 
as its voltage equivalent Sq. The impact on device loss is illustrated 
in Fig. 11, where the loss per transfer for the same device shown in 
Fig. 8 is plotted as a function of the charge existing in a ZERO when a 
ONE contains 2 volts. It is presumed that the information carried is an 
alternating series of ONE’s and zErRo’s. This represents a worst-case 
condition when nonempty or “fat” zmRo’s are used. From this figure, it 
is readily seen that a 2 to 1 ONE to ZERO ratio leads to practically an 
order of magnitude reduction in loss at a given frequency. Operation 
in this way requires the use of a threshold detector to ascertain 
whether a ONE or a ZERO was present; the success of such operation 
depends in large part upon the realization of a practical threshold 

detector. | . 

- The solution of the nonlinear diffusion equation also gives charge 
distributions in the CCD as a function of time. The charge distribu- 
tion at selected times is shown in Fig. 12. Initially the left-hand end 
is almost totally undisturbed by the presence of an accepting well at 
the right- nme end of the CCD, but then as charge transfers out of 
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Fig. 10—The remaining charge at several times after the initiation of transfer 
presented as a function of the initial charge Sqo. 
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Fig. 11—The fractional diminution of a one as a function of the amplitude of 
a “fat” ZERO in an alternating series of ONH’s and zERo’s. This example also refers 
to a 25-um 2-phase p-channel CCD. 
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the device, the left end follows, but the profile which is established 
relatively early in the device cycle remains at least superficially un- 
changed through the course of the transfer. 

If the potential of the source well is being changed at the same time 
the transfer is taking place, it is valuable to know the time develop- 
ment of the highest charge density in the source well, because it is 
necessary to prevent this point from ever reaching an injection con- 
dition. In this approximation injection occurs when Sq > y(Va — 
Vrg), and the well is no longer capable of holding the charge; the well 
and the substrate then form a forward biased p-n junction. For the 
example discussed before, the 25-4m p-channel CCD, the highest volt- 
age in the well has been plotted as a function of time, assuming initial 
charges of 10 volts, 2 volts, and 1 volt; Fig. 13 shows the real time 
for the specified device and the general, normalized time. This figure 
shows that the potential on the plate must not drop to zero in a time 
less than 100 ns, if the injection of stored CCD charge into the bulk 
of the semiconductor is to be avoided. 

It is unlikely that the initial charge configuration in a CCD will 
be perfectly square as assumed. To test the effect of this assumption 
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Fig. 12—The charge density as a function of position under the CCD plate, 
and the parameter is the time after initiation of transfer. The percentages refer 
to the total fraction of charge remaining under the plate. 
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Fig. 13—The amplitude of the largest surface potential difference Sq(0) as a 
function of time. Two time-axes have been provided, the generalized 7, and 
t for a 25-um p-channel device. 


on the results presented here, two initial conditions were used; one 
had charge distributed uniformly to an Sq value of 8 volts, and the 
other had charge distributed nonuniformly but having the same average 
charge density, that is Sq = 8 volts. Figure 14 shows the difference 
in the charge remaining in the potential well as a function of time for 
these two distributions, and it is easily seen that beyond the time of 
rt = 0.25 the two charge quantities are substantially identical. 

The effect of finite carrier velocity at the transfer point was also 
tested. If infinite velocity cannot be achieved, the charge density at 
the point L cannot be 0, but it must be that finite value which will 
allow current to flow out of the potential well at the saturation 
velocity. Using this fact, it is possible to write the charge density 
at L as a function of the time rate of change of the total charge in 
the well dQ/dt. 


L dt 
v 


u(L) = Sq(L) = + = =F. 21) 
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Here v, is a saturation velocity and, assuming its value to be 10’ centi- 
meters per volt-second, the solution of equation (10) was reprogramed 
to reflect a value u(L) in accordance with equation (21). The dif- 
ference caused in the result was negligible, but because of the L?/p 
time scaling, it was more significant for very small plates than for 
the larger plates. The time error is shown in Fig. 15 as a function of 
charge remaining beneath the plate. 


V. DISCUSSION 


We have developed an equation governing charge transfer in flat- 
plate CCD’s and shown how it can be solved numerically by an un- 
conditionally stable finite difference routine. These solutions have 
been applied to the calculation of the properties of CCD’s subject to 
certain constraints; the most serious of these limitations is the assump- 
tion of the instantaneous creation of the transferring condition. This 
assumption transforms into physical reality as the use of square- 
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Fig. 14—T wo initial conditions are compared to evaluate the impact of non- 
uniform distribution of a given total charge on the transfer predictions. The 
ordinate is the fractional difference in the total remaining charge and the 
abscissa is normalized time. 
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Fig. 15—An illustration of the effect of limiting the carrier velocity at 2 — L 
to 107 cm/s, expressed as an error between the time predicted in the idealized 
case and the time in the velocity-limited case. 


wave driving. Under these conditions, we have shown that with 
empty zERo’s losses of the order of 1 percent can be achieved in the 
vicinity of 1 MHz in p-channel CCD’s, and the same order of loss at 
5 MHz in n-channel CCD’s. Further it has been shown that the use 
of ‘fat’ zeRo’s can effectively reduce this loss by as much. as an order 
of magnitude. The problem which has not been solved here is’ that 
_ which arises when the accepting well is not created instantaneously but 
moves down with a finite time-rate of potential change. In this situa- 
tion, 2 ONE will have a longer time in which to transfer than a ZERO 
will. This may tend to reduce the amount of charge left behind by the 
ONE as contrasted with that left behind by a zzRo. This would diminish 
the loss. At this stage, it is evident that losses of realizable CCD’s, ice., 
those with plates in the range of 15 to 25 wm, will be modest in the 
5 to 10 MHz range, particularly if they are made using an n-channel 
technology. Using properly loaded one’s and zEro’s, short shift registers 
should deliver useful signals, even clocked at 50 MHz. 

This analysis has also shown that the existence of field-free regions 
in CCD’s represents a considerable limitation on their performance. 
It will be possible to further lower losses by arranging substantial 
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penetration of transverse electric fields under the plates so that the 
charge transfer is limited by drift rather than diffusion. In this case, 
the scattering limited velocity assumes an important role. 

CCD’s on 10 9-cm substrates appear, on the basis of preliminary 
measurements,** to behave according to the predictions of the space 
charge assisted diffusion theory. 
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Linearized Dispersion Relation and Green’s 
Function for Discrete-Charge-Transfer 
Devices with Incomplete Transfer 


By W. B. JOYCE and W. J. BERTRAM 
(Manuscript received February 19, 1971) 


A small-signal (linearized) theory of discrete-charge-transfer-device 
performance is presented for the case of incomplete charge transfer. 
Specifically, the dispersion relation is derived which relates the charge- 
transfer efficiencies presently characterizing these discrete (in space 
and time) devices to the usual measures of device or transmission-line 
performance based on the attenuation, dispersion, phase velocity, etc., 
of sine waves. Ina more general sense this emphasizes the applicability 
of conventional signal theory to these new devices. The impulse 
solution or Green’s function is then shown to be the equivalent of a 
bivariate distribution in probability theory. More generally the utility 
of (deterministically interpreted) probability theory 1s emphasized 
by showing the equivalence of a general small-signal theory to a 
random-walk process. 


I. INTRODUCTION 


In an important new class of discrete-charge-transfer devices includ- 
ing charge-coupled devices: (CCD’s), bucket-brigade shift registers,?-° 
and other shift-register or image-detection or display devices, ex- 
ternally applied time-dependent voltages step captive charge along 
a chain of equivalent discrete storage stations. In some of these 
devices the charge transfer is imperfect with a fraction of the charge 
failing to advance and a fraction lost altogether during each step. 
Explicit expressions are constructed here for the dispersion relations 
and Green’s functions which describe this imperfect performance 
under conditions when the fractions of charge that go astray can be 
described by constant parameters characteristic of the particular 
device. . 

The theory of analog signal processing is based on the properties 
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of sine waves, and analog devices are conventionally characterized 
by the dispersion, attenuation, etc., which they cause. A second 
standard method of characterization is based on the distortion and 
attenuation of pulses. In contrast, discrete-charge-transfer devices 
are presently characterized by their charge-transfer efficiencies 

(fractions). The solutions given here were chosen to facilitate the 
application of conventional signal processing theory to these new 
discrete devices. That is, they show the equivalence of these three 
methods of. characterization in the small-signal limit and provide the 
appropriate interrelating formulas. Specifically, Section III treats the 
sinusoidal representation, while Section IV considers pulses. 

. Although the physical interpretation of our equations is determinis- 
tic, it is also an objective in Section IV to show the direct applicability 
of many established results of probability theory. In the Appendix our 
basic equation is re-derived as the simplest nontrivial example of a 
discrete small-signal theory. In turn, the small-signal theory is seen 
to be the deterministic limit of a random-walk process. 

Whether the mode of device operation be digital or analog, one is 
ordinarily interested in utilizing the full information capability; 
i.e., maximum bandwidth or wavelengths approaching twice the sta- 
tion separation in shortness. Thus, we at no time approximate the 
discrete equation by a (continuous) differential equation. 

In summary our objective is to emphasize by way of a case of 
present interest that standard methods of device characterization, as 
well as established probability theory, can be applied to discrete 
charge-transfer devices. 


II. BASIC EQUATION 


Discrete-charge-transfer devices are often, to a good approximation 
if not exactly, discrete in both space and time. That is, the informa- 
tion-bearing charge is moved in discrete bursts in time from one 
spatially discrete storage station (e.g., capacitor or potential well) to 
the next along a line of stations. Consider gq, the charge in station 
a at time ¢ where x and ¢ assume only integer values; ie., the unit 
of time is taken as the stepping interval, and the unit of distance is 
taken as the station separation (center-to-center). Perfect charge 
transfer implies 


Q2,t = Qz-1,t-1 (1) 


and hence unit signal speed. 
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Real devices are typically experimentally characterized by the 
fraction a of the charge which is successfully advanced per step.** If 
a fraction ¢ fails to advance and remains in its original station, then 
the process is described by 


Qz.t = OQz-1,1-1 1 €Qz,1-15 (2) 
which reduces to equation (1) in the ideal case of a = 1 and« = 0. In 
other words, equation (2) states that the charge in station x at time f, 
Gut , 18 the successfully transferred fraction of the charge in the previous 
station at the previous time, a@z-1, +1, plus a fraction « of the charge 
Qx,t-1 1n station x at the previous time which failed to advance. From 
equation (2) it follows that a fraction @ = 1 — a — « of the charge is 
lost per step. Theoretical expressions for a and/or e are contained ex- 
plicitly or implicitly in the work of several authors on different de- 
vices.** [In the appendix, equation (2) is generalized slightly to 
include an inhomogeneous term and nonconstant values of a and «.| In 
this context equation (2) was introduced by Berglund to deseribe in- 
complete transfer in CCD and IGFET bucket-brigade shift registers, 
and he showed an approximate equivalence at low frequencies to an 
(analog) matched transmission line.* Traditionally equation (2) is 
associated with probability theory (Bernoulli trials) where a + « = 1.° 
In the case a = « = 1 (outside our primary interpretation) equation 
(2) is usually known as Pascal’s triangle although he was preceded by 
Cardan in 1540 who, in turn, cited earlier sources.11 We give here a 
number of exact and exact limiting (« — 0) solutions to equation (2) 
useful both in image-detection and shift-register contexts. It should be 
noted that our results can also be applied to Berglund’s model* of 
bipolar bucket-brigade shift registers which differs from equation (2) 
only through an interchange of the physical interpretation of x and ¢. 


II. SINUSOIDAL REPRESENTATION 


The dispersion relations can be found from the space and time 
Fourier transforms of equation (2) or by the separation-of-variables 
method. Either approach amounts to seeking a running-wave solution 
of the form 


Ime on? | x L, | t | SiO dre eee (3) 


where k and » may be complex to account for the attenuation, and ¢ is 
any constant phase factor. Since x takes on only integer values, equa- 
tion (3) is not affected by adding multiples of 27 to the real part of k, 
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and thus without loss of generality we require | Re k| S z=. In the 
theory of lattice vibrations (where time is continuous but the physical 
system is discrete) this range limitation on Re k is usually expressed by 
saying that the wavelength with one atomic species must be at least 
two lattice spacings.1® Similarly, since time is discrete, all frequencies 
outside the fundamental range | Re »| S 7 are redundant. In the 
theory of sampled-data control systems (where the physical system 
is usually continuous but time is discrete) this is usually expressed 
by saying that the sampling frequency must be at least twice the 
maximum frequency to be detected.1* (Here the sampling frequency f, 
is once per unit time, 1.e., f, = 1 or o, = 27). Substituting equation (3) 
into equation (2) yields for the dispersion relation between w and k 
e° = ae + ¢, (4) 

Two important special cases of equation (4) are discussed in the rest of 
this section. Since equation (4) shows that the dispersion relations are 
independent of », we take » = O hereafter. 

In the case of image detection (or projection), & is real correspond- 
ing to a term in the spatial Fourier representation of the initial image. 
Equation (4) then reduces to 


w(k) = w’ + tw!’ = —w*(—k) (5) 
og oe ei SE 
eas a cos k + a/e 


— i(In a + 4 In [1 + (€/a)* + 2(/a) cos k)); (6) 
i.e., the wave is attenuated in time but remains spatially sinusoidal. 
The identity 
__sin 0 smd 
c + cos 6 c+ cos0 
was used to express the effect of nonzero « as a separate correction 
term in equation (6). The real and imaginary parts, o’ and o” are 
plotted in Fig. 1. Although not considered here, it is sometimes useful 
to regard a and « as k-dependent (i.e., wavelength-dependent) quanti- 
ties. In the practical case of small «/a, equation (6) is usefully ap- 
proximated by truncating its expansion in powers of «/a after the 
linear term, 1.e., 


tan * + tan" 


w(k) > k — (e/a) sn k — i[lIna + (e/a) cos Kk], e/a > 0 
—>k — esink + z[€ + e(1 — cosk)], e, > 0. (7) 
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A phase velocity v{ can be defined which describes the apparent speed 
of the crests and troughs of the attenuating sine wave; i.e., 


sin k 





/ 
vi(k, e/a) = ae =1—k" tan” ee (8) 
ee ee e/a > 0. (9) 
a k 
In particular 
velk, e/a) + vg(k, a/e) = 1 (10) 
ve(k, 1) = 2 (11) 


vik, e/a) > 1 as koa if ela <l 
—>0 as kor if ¢efa>l. 
At long wavelengths (i.e., k > 0) wo’ ~ k while wo” — const ~ k?. Thus 


we ignore the attenuation and find an infinite-wavelength group 
velocity 


dw’ (0) 
dk 
In the case of shift-register operation » is real, corresponding to a 
term in the Fourier representation of a time-dependent signal intro- 
duced into one station. Equation (4) then reduces to 


= 4(0, e/a) = (1 + e/a)”. 


k@) = k! — tk’ = —k*(—o), OSwSr (12) 


sin w 
— cosw 


+ i[na—4m (1+ & — 2 cosw)] (13) 


(6) + tan * =] 
é€ 


—>w-+esnw+i(Ine + € cosa), e— 0 (14) 

—w+esinw — 27[6+ «(1 — coswa)], e,f— 0; 
i.e., the wave is attenuated as a function of x but is purely sinusoidal 
in time at a given x. The identity following equation (6) was again 
used. The quantity 7k corresponds to the propagation function of 
(continuous-time) lossy transmission lines; similarly k’ and k” corre- 
spond to the phase function and attenuation function respectively. 
When Fig. 1 is rotated through 180 degrees in its plane, it becomes a 
plot of equation (13). Although not considered here it is sometimes 
useful to regard a and ¢ as w-dependent quantities. 
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the (real) angular wave number k. After conversion from degrees to radians, k is 
measured in units of (27 times the stations spacing)-1. The curves are param- 
eterized by the ratio e/a where a@ is the fraction of the charge successfully. 
advanced per stepping operation and e is the fraction of the charge which remains 
in a station per step [From equation (6)]. 
Fig. la—The real part of w. (Note that w’(k, e/a) + w'(k, a/e) = k.) 
Fig. 1lb—The imaginary part of w. 

After rotating the figures by 180 degrees: For a sinusoidal signal of the form 
expli(wt — kz)] with |z|, |t] = 0,1, 2, --- (temporal sine wave) the complex 
angular wave number k = k’ — 7k” is plotted versus the (real) angular signal 
frequency w where w is measured in degrees per stepping interval. The curves 
are parameterized by e where 0 < e < 1 is the physically significant range 


[From equation (13)]. 


Fig. la—The real part of k. - 


Fig. tb—The imaginary part of k. 


The phase velocity is given by 


i -1 
Vo, ) = 7 = E + 7? tan” ane | 


COS w 
(15) 


sin w 





~1l-e ; e— 0. 
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k’’, and hence the attenuation, is frequency dependent for any « > 0 
(just as w” is k dependent for any e/a > 0); however, as one would 
expect*® from causality (Hilbert Transform; Kramers-Kronig Rela- 
tion) in a continuous-time system, the apparent phase velocity w/k’ 
has no dispersionless case for 0 < « < 1 analogous to equation (11). At 
low frequencies k’ ~ w while k” — const ~ w?. Thus we ignore the 
attenuation and find a zero-frequency group velocity 


(zo) = ,(0,<-) =1—e. 
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For a numerical example in the small-e approximation, let a time- 
dependent signal be introduced into a shift register at x = 0 and ex- 
tracted at station x. If z is large, the product ex is not necessarily small 
compared to unity; consider the case er = 0.2. From equation (15) 
the phase transit time through the shift register is z/v, = xk’/w = 
x + xe sin w/w. Thus the highest frequency component of the signal 
(w = 7) is transmitted in the ideal (i.e., « = 0) time zx. Lower frequencies 
take increasingly long with the w = O component arriving ze = 0.2 
stepping intervals after the « = m# component. The amplitude is at- 
tenuated by the factor e“"’* = ae °° ° which is a decreasing function 
of frequency. Thus the cut-off frequency (» = 7) is attenuated by 
an additional factor of e°** = e°* = 0.67 over the attenuation of 
the w = 0 component. 

Equation (4) is invariant under the transformation 


wok k-w 


a—I/a e— —é/a 
which can therefore be used to derive from each other the two parallel 
sets of relations developed in this section. 
If » is eliminated from equations (3) and (4), equation (25) results; 
if k is eliminated, equation (33) results. 


IV. IMPULSE REPRESENTATION 


Having established the connection with the usual basis for charac- 
terizing continuous linear systems in terms of their effect on sine 
waves, we turn to the impulse representation and investigate the solu- 
tion of a unit charge placed in station x = 0 at ¢ = 0 with all other 
stations initially uncharged. By considering the spatial distribution 
after the first few steppings as shown in Fig. 2, the solution to equation 
(2) with this boundary condition can be recognized as the binomial ex- 
pansion of (a + «)'; ie, 


.=G6,,= (‘ere 0 


= 0 otherwise 


IIA 


xst ; 
(16) 


where (!) is the widely tabulated binomial coefficient. Later we will 
regard G,,, a8 the Green’s function for more general initial or boundary 
conditions. Although our interpretation is deterministic rather than 
probabilistic, the terminology and results of probability theory will 
be quite useful. For example, equation (16) is a discrete bivariate 
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Fig. 2—Evolution of a unit charge initially in station x= 0 at ¢ = 0. A fraction 
a of the charge is advanced per step and a fraction e remains behind. 


(x and ¢ independent discrete variables) distribution, defined by a 
parabolic (partial) finite difference equation [Equation (2)]. 

In the description of practical devices one is usually interested in 
limiting expressions as ¢ and/or ¢ approach zero. Rather than writing 
several limiting forms at each point in the development, we emphasize 
once and for all an approximation which, in one variation or another, 
is frequently useful, 


ate =1—-)H=PiBOO ae! wW<l. (17) 


Although in a real device the loss per transfer may be orders of mag- 
nitude less than unity, the number of stations and hence transfers 
may be so large that t¢ is comparable to unity, and the further ap- 
proximation e ‘‘ + 1 — tis not generally accurate. 

Consider first the spatial distribution of charge with time regarded 
merely as a parameter, i.e., a horizontal row in Fig. 2. In probability 
theory it is usually assumed that a + ¢€ = 1 (probability of heads 
plus probability of tails are unity). To make direct use of extensively 
compiled properties define 


ae =a/i-t), ¢& =«/i —?%) (18) 
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so that a’ + ¢’ = 1. Then equation (16) becomes 

G28 ot( are SESH. aoe AD) 
which defines b, the binomial distribution” with normalization 


wo = Db =@' +e) =1 (20) 


and (1 — 4)‘ is just a position-independent scale factor. (The inter- 
mediate steps involved in carrying out the numerous sums which 
follow can be found in many texts on probability theory.”) 

If the speed of the pulse is defined as the speed of the center of 
mass of that charge remaining at any time, then from 


(x) = > xb, = a’t (21) 


it follows that the speed is a = (1 + </a)-, i.e., the same as the 
infinite-wavelength group velocity following equation (11). In general, 
the pulse can be characterized by its central moments » of which the 
rth 


t 
by = Dd, (« — (2))"b. , r=0,1,---.,@ (22) 
xr=0 


can be obtained explicitly as the coefficient of s*/r! in the Taylor- 
series expansion in s of the central-moment-generating function" 


t 
ere ye eee ia?) (23) 
xz=0 


Thus in particular the variance of the pulse, a useful measure of its 
spread, is pe = ta’ && = t(2 + a/e + €/a) where the last form again 
explicitly exhibits the fact that normalization-independent properties 
depend on a and e only through their ratio. Higher central moments 


are conveniently obtained from Romanovsky’s recursion relation.171® 


Mra = oe’ (trp, + du,/do’). (24) 

Some further useful relations are as follows: When the pulse spreads 

excessively for digital applications, one alternative is to utilize 

2p + 1 consecutive stations for one pulse. It then follows directly 

from the parallel-axis theorem for the moment of inertia that the 

variance is increased by (only) the additive term p(p + 1)/3 over 
the previous result. 
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At time ¢ the ratio of the charge in the next-to-leading station to 
that in the leading station of the pulse is te/a. Thus the leading station 
of the pulse is also the most heavily charged until time t = a/e when 
the peak starts to drop back from the lead. At all times the charge 
distribution falls off monotonically from the peak. At any time the 
fraction of the remaining charge confined to the leading station is 
(1 + ¢/a)~*. 

The results in Section III can be derived as consequences of those 
of Section IV, and conversely. For example the single-spatial-Fourier- 
transform solution of equation (2) can be obtained via the binomial 
distribution regarded as a Green’s function (in the terminology of 
mathematical physics). Thus the initial sinusoidal spatial charge dis- 
tribution G20 = Im e~** evolves into qz at time t where 


t 
Qz,t = Im > Gace Vr 
a oe _ (25) 
= Im (e + ae")'e ™ = Im (a + ee ™, 


Equation (25) is equivalent to equations (3) and (4) with o elimi- 
nated. Equation (6) results if the real and imaginary parts of equation 
(25) are extracted. Apart from the normalization, the evolution factor 
which carries the initial spatial distribution e~*” into the later distribu- 
tion can be recognized as (¢ + a’ e**)* the characteristic function 
(Fourier transform) of the binomial distribution in probability 
theory.’ In the last form of equation (25), (a + ce*)* evolves the 
unperturbed or ideal (e = 0, a = 1) solution e*(**) into the actual 
solution. Finally (1 + («/a)e“*)* (— exp(e“*te/a) as </a> 0) evolves 
the « = 0 solution a‘e**(-), 

As before [cf. equation (7)] we seek an approximation of the Green’s 
function in the limit that e/a, but not necessarily te/a, is small compared 
to unity. If ¢ is also small, repeated use of equation (17) yields 


G.,.7e€ ‘p(ti-—2), €¢f—-0, to, te, t = const. 
het, t=tt—-1l-- (26) 


where p(u) = e>r“/u! (uw = 0, 1, -::) is the Poisson distribution.2” 
Sum rules similar to those illustrated for the binomial distribution 
can be carried out yielding in particular a pulse speed (x)/f = 1 — 
A/t = 1 — e and a variance A = te. As long as the pulse peaks near 
or at the leading station, the Poisson distribution is usefully accurate 
for the more strongly charged stations even when ¢ is only modestly 
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large. A comparison of the binomial distribution and its Poisson ap- 
proximation are given in Table I. 

It is impractical to summarize the very many relevant exact and 
approximate properties of the binomial distribution here, particularly 
since a thorough compilation is already available.” We have, however, 
given a few of the most important properties to illustrate the useful- 
ness of this connection with the extensive literature of probability 
theory. 

Turning now to a shift-register interpretation of equation (16), we 
regard x as a parameter (the number of the stations which are used as 
the register) and study the time-dependence of the charge in station 
x — 1, ie., a vertical column in Fig. 2. The development proceeds in 
close analogy to that of the binomial distribution and most of the 
motivating remarks need not be repeated. 


TaspLe I—A ComPARISON OF THE BINOMIAL AND NEGATIVE BINOMIAL 
DISTRIBUTIONS WITH THEIR PoIssOoN APPROXIMATION 


1 2 3 4 5 6 
x T 
9 0 0 0 0 —1 
8 0 0.430 0.411 0 .430 0 
7 0 0.383 0.365 0.344 1 
6 0 0.149 0.162 0.155 2 
5 0 0.033 0.048 0.052 3 
4 0 0.0046 0.011 0.014 4 
3 0 4.1 X 1074 1.9 X 1073 3.4 & 1073 5 
2 0 2.3 X 1075 2.8 X 1074 7.4 X 10-¢ 6 
1 0 7.2 X 1077 3.6 X 1075 1.5 X 1074 7 
0 1 1 xX 10° 4.0 X 10-8 2.8 X 1075 8 
—1 0 0 3.9 X 1077 4.9 X 107% 9 
~2 0 0 3.5 X 107% 8.4 XK 1077 10 
—3 0 0 a — 11 


For the case a = 0.9 (fraction of charge successfully advanced per step), « = 0.1 
(fraction which remains in its station per step), and ¢ = 0 (fraction of. charge lost 
per step). 


Column 1: Station number x for columns 2, 3, and 4. 

Column 2: Initial distribution of charge among the stations. 

Column 3: Binomial distribution of charge among stations eight steps later [from 
equation (16) and (19)]. 

Column 6: Detection time 7 for Columns 5 and 4 as measured from time of initial 
detection, 7 = 0. 

Column 5: a times the negative binomial distribution of charge nondestructively 
observed in station 7 (a times equation (27)) as a result of initial spatial 
distribution of Column 2 = distribution of charge observed by a charge- 
removing detector in station 8 (cf. Appendix). 

Catuink 4: Poisson approximation to Columns 3 and 5 [from equation (26) or (34)]. 
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It is notationally convenient to use a new time 7 which starts at 
time € = x — 1 when some of the charge placed initially in station 
zero is first observed in station x —1,i.e.,7 = t — x + 1. Then equation 
(16) becomes 


G.-1,. = 4 ve i. Da, 7=0,1,°-:, 0. (27) 


If at is defined as 1 — « (so that at + « = 1), equation (27) becomes 


dante siprvee(e)e es 


and defines n,; the negative binomial distribution?” with normalization 





Mo = Don, =a(l—o)*=1, (29) 
7T=0 


Since charge which fails to advance is observed again, the sum of all 
charge (nondestructively) observed in station z — 1, ie., a "(1 — ©)’, 
may exceed unity even if £ = O (cf. the discussion of boundary con- 
ditions in the appendix). Whereas all normalization-independent 
properties of the binomial distribution depend on a@ and e only through 
their ratio «/a, here all normalization-independent properties depend 
only on e (cf. Figs. 1 and 2). 

The charge in station x — 1 is observed at a mean time (t) = x — 
1 + (7) where 


(7) = > ™, = re(1 — 6) (30) 


and thus in the shift-register context the pulse speed can be defined 
as (x — 1)/(t) which approaches 1 — e for large x; i.e., the same as 
the zero-frequency group velocity following equation (15) but faster 
than the definition following equations (11) and (21) except in the 
case of no charge loss (¢ = 0) when all definitions agree. 

The rth central moment of the charge sequence in station x — 1 is 


b= De Wn, (31) 


which can be obtained by direct calculation or as the coefficient of 
s'/r! in the Taylor expansion in powers of s of the central-moment 
generating function?’ 
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et) etn =e OU = et — 2. (32) 
7=0 


In particular, the variance is pw, = ve (1 — <)>. 

The first charge observed in station x — 1 is a fraction (1 — «)? of 
all that will be observed in that station. It is also the largest charge 
observed in x — 1 if xe, the ratio of the second amount observed to 
the first, is less than unity. Thus the input, ze, for the example at the 
end of Section III can be obtained experimentally from the ratio of 
the first two nonzero charges present in x — 1 due to a single initial 
charge in x =.0. More accurate graphical methods actually developed 
for statistical contexts can be directly applied here to infer the param- 
eters from the final distribution resulting from an initial pulse.*” 

According to equation (2) a sinusoidal signal go. = Im e* present 
in station zero at time ¢ adds an additional charge Aqi,441 to station 1 
at ¢ + 1 where Agi,441 = aQo,z (cf. the Appendix). Thus from equation 
(28) the charge present in station x at time ¢ is 


c) 
Qe.t es Im G bie Te 
7T=0 


x x 
Qa i a : = 
Im ( = Yor = Im ( =) ge x) 
e° —e 1 — ee 


which is equivalent to equations (3) and (4) with k eliminated. Equa- 
tion (18) results if the real and imaginary parts of equation (33) are 
extracted. Apart from the normalization the transfer factor which 
multiplies the initial signal e”* in equation (83) is the characteristic 
function of the negative binomial] distribution. 


(33) 





In the limit of small ¢« and ¢, but not necessarily small ex or ef, G,,, 
becomes exp (—fz) times the Poisson distribution; 1.e., 


G,, 76 “e“A/r!, €,€>0;x— ~; fz, ex = const A = ze (34) 


which is functionally equivalent to equation (26). Thus as illustrated 
in Table I, the distinction between the spatial and temporal projec- 
tions of the impulse solution disappears in the Poisson limit [cf. equa- 
tions (7) and (14) ]. 


V. SUMMARY 


Without reviewing any one of the three specialized fields individ- 
ually, we have emphasized by way of an explicit case of current in- 
terest the connection between, on the one hand, the present practice 
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of characterizing discrete charge-transfer devices in terms of their 
charge-transfer efficiencies and, on the other hand, the two well- 
developed fields of (analog) sinusoidal signal analysis and probability 
theory (interpreted deterministically). 

Parenthetically we note that the model of this paper [equation (2) ] 
is of some tutorial interest in that it permits the basic ideas of a 
traveling-wave description of discrete-space-and-time linear systems 
(including therefore as special cases the limits of continuous space 
and time) to be exemplified immediately in a unified manner via 
a simple device. The usual textbook vehicles such as sampled-data 
control systems, lattices, or transmission lines suffer tutorially in 
varying degrees from being too restricted (discrete in only space or 
time; no attenuation) and requiring a lengthy and specialized physical 
explanation of the origin of the basic equation in a less easily 
visualized system. 
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APPENDIX 


In the main portion of this paper the boundary conditions were 
deemphasized. For example, in shift-register operation the charge 
distribution was assumed to be unaffected by its own detection. Let 
dz. be the nondestructively observed charge as distinguished from 
qi, , the charge measured by a destruction process that removes all 
of the charge from station x. Then by equation (2) these two limiting 
cases are simply related as follows 

Qo.2 = CGe-1,2-1 5 (35) 
ie., the expressions given for a nondestructively observed shift- 
register need only be multiplied by a to obtain their destructively 
measured counterparts in a register with one more station. In the 
case of a detection process which removes a fixed fraction of the 
charge, the signal can be obtained from the difference between suc- 
cessive measurements. 

Similarly the two limiting cases at the input station of the register 
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are Dirichlet (voltage) and Neumann (charge) boundary conditions. 
In the Neumann case a fixed amount of charge is added to whatever 
charge may already be in the station. In the Dirichlet case a fixed 
applied voltage adds charge as necessary to what is already in the 
station thereby achieving a total charge predetermined by the voltage 
level. By equation (2), and in analogy to equation (35), these two 
cases are simply related because a fixed total charge q, in x at ¢t 
injects a fixed additional charge aq,, into x + 1 at ¢ + 1 (cf. equa- 
tion (33)). 

To generalize equation (2) consider a constant inhomogeneous term 
gi which might, for example, represent the amount of dark-current 
charge added per station per stepping interval. Then (2) becomes 


Qe: = OQz-1,1-1 + €Gz,t-1 + Qa - (36) 
An important particular solution of equation (36) is the spatially- 


uniform rising-in-time solution which starts from a spatially uniform 
distribution go at t = 0: 


= Bq te = as, B=ate, t=0,1,--- (37) 


= qo + qat, £= 0. 
This could describe a recirculating memory where the output of ‘he 
last station is fed back into the initial station. The constant-in-time 
rising-in-space solution is 











ee LS 
i= VOT, .7— © a, Y=7->: C= 0; bees 
(38) 


=o+7—-&, f=0, 


where qo is now the charge in the (initial) station x = 0. This steady- 
state distribution could be maintained in a shift register by removing 
a charge da — (1 — «) qo from the initial station per step. Equation (2) 
is now understood to describe a-signal (homogeneous solution) imposed 
on some particular solution to equation (36) describing the back- 
ground charge. 

Next assume, as is certainly true in some devices, that the transfer 
fractions a and e depend upon at least some of the q’s; 1.e., the process 
is nonlinear. Then in the small-signal case (whether or not qq is fur- 
ther generalized to be a function of x and t) @ and e in equation (2) 
would still be independent of the magnitude, of the signal charge but 
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no longer independent of x and t even though all stations were phys- 
ically equivalent apart from their momentary charges. In the case of 
equation (37) a and « would still be « independent; in the case of 
equation (38) still ¢ independent?® (cf. nonuniform transmission lines). 
Before proceeding we switch to the suitably developed terminology 
of probability theory (without necessarily implying a probabilistic 
interpretation). Actually, this has some plausible justification. To the 
extent that equation (2) applies independently to each of the indi- 
visible electrons or holes in the charge, a and « must be interpreted as 
transition probabilities, and our deterministic interpretation results 
. only because the large number of electrons or holes permits fluctuations 
to be ignored. In this probabilistic sense one can define the information 
content of a discrete charge distribution.?° Similarly in equation (86), 
qa(z, t) could be a random variable describing the introduction of 
extraneous noise. | | 
To complete the small-signal or linear model for the signal we note 
that in practice the stations are driven by n-phase time-dependent 
voltages such that in a coordinate system y which moves at the ideal 
signal speed (y = x — t) the voltage distribution among the stations 
appears constant in discrete time and periodic in discrete space with 
a period of n stations. In each cycle of n stations one station is the 
designated potential well which, in the ideal case, carries all of the 
charge in that spatial cycle. The distortion of the signal corresponds 
to the transitions which the electrons or holes make to neighboring 
stations and, eventually, to neighboring wells. In a causal system the 
homogeneous equation for the signal charge takes the form 


t-1 Ce) 
Qt = » Dy Lia tarda ae ; (39) 


th=—0 y’=—0 ; 
where the transition elements of the JT matrix are g-independent in a 
small-signal theory. (In some cases, the upper limit of % is ¢ rather 
than ¢ — 1.) 

If the system is memoryless such that the spatial charge distribution 
at one time is sufficient to determine the distribution at the next time 
and hence for all time, then equation (39) reduces to the Markoffian 
form 


Qt = oD Do (espa r (40) 


This simplification corresponds, for example, to neglecting traps which 
accept signal charge at a rate proportional to the local signal and then 
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emit the charges into the signal over a time scale long compared to 
the stepping interval. 


When the background charge distribution, if any, is time independent, 
Ty.y't,.7 depends on ¢ and ¢’ only through their difference. If the back- 
ground distribution is spatially constant, T is a periodic function of 
y — y’;i.e., T describes a random-walk” problem on a one-dimensional 
lattice with anisotropic and spatially periodic transition probabilities 
(cf. Floquet’s theorem”’). 

Finally, of course, large signals under nonlinear conditions are of 
interest. However, it is not yet clear that there is any dependence of T 
on the q’s which is more general than the specific device where it 
arises. 

In the moving coordinate system equation (2) becomes 


Qu,t = ADy,t-1 + €Qy+1,t-1 (41) 


which is functionally equivalent to equation (2) and distinct only in 
that the roles of a and «, as well as the sense of spatial direction, are 
reversed. If the transition probabilities (7 matrix elements) are small 
out of the wells and large out of the other stations between wells, then 
‘this 1-phase equation can be used to approximate an n-phase system. 
The unit of time is taken as the effective stepping interval equal to n 
actual stepping intervals, and successive integer values of x or y label 
the n-cycle groups or the wells. The matrix elements « and @ in equa- 
tion (41) then describe a net transfer of charge between, or retention of 
charge by, the n-cycle moving groups during one (effective) time unit. 
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Micromachining and Image Recording on 


Thin Films by Laser Beams 


By D. MAYDAN 
(Manuscript received March 22, 1971) 


A theoretical and experimental description of laser machining of 
thin metallic films vs gwen. Calculations are carried out for the tem- 
perature rise of a thin film in response to a pulse of incident light 
energy and for the dependence of the temperature rise on the different 
thermal constants of the substrate and film and on the illumination 
conditions. 

Experiments have been conducted on the pulsed machining of thin 
bismuth films with glass and mylar as substrates using a lowest- 
order transverse mode argon laser with an average power output 
capability of 20 milliwatts. A fast intracavity acoustooptical modula- 
tion system produced optical pulses with durations controllable from 
25 ns to several ms. The pulse repetition rate could be varied from a 
single pulse up to several MHz. The peak power depended on the duty 
cycle but was limited to 2 watts for low duty cycles. Experiments were 
done with both front and back illumination. (In back illumination the 
laser beam ts incident on the film through the substrate.) For optimal 
machining conditions in which the optical beam diameter is adjusted 
to produce the maximum diameter of transparent area for a given 
pulse energy, less than 50 percent of the removed materval left the 
surface of the substrate. The remainder was displaced so as to leave 
some areas free of bismuth while the thickness of bismuth in other 
areas was increased. With a pulse duration of 25 ns and a 600-A-thick 
bismuth film on mylar, the peak power required to machine a 6-nm- 
diameter spot was about 0.7 watt. 

Variable amplitude light pulses produced by the intracavity modu- 
lation system with a 1-MHz repetition rate and duration of 25 ns were 
used to write images on a 600-A-thick bismuth film, deposited on a 
mylar substrate, by deflecting the laser beam in raster fashion over the 
surface of the bismuth film. The average laser power output was 20 
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mW. The film area was 8 X 10 mm2. Both positive and negative 
continuous-tone images were recorded. The images consisted of an 
array of 1200 by 2000 completely independent spots of varying diam- 
eter and a spot density of 4 X 10°/cm?. Images of documents which 
were originally of size 84 X 11 inches showed a limiting resolution of 
over 175 lines per inch when magnified to the original size. 


I. INTRODUCTION 


Images can be recorded on an opaque metallic film on a transparent 
substrate by using an amplitude-modulated laser beam to machine 
the metallic film. The modulated laser beam is scanned in raster 
fashion over the surface of the film, and the thermal energy dissipated 
in the film causes a local displacement or removal of the metallic 
material. In this way a permanent image suitable for immediate dis- 
play or storage is created. 

Experiments along this line have been reported!? but reveal a 
number of shortcomings from the standpoint of practical application. 
Generally, there is no gray scale, only negative line images are repro- 
duced, resolution is limited, and the required laser power is large. 

A new laser machining technique is described in this paper which 
overcomes these shortcomings and permits the generation of high- 
quality, continuous-tone, permanent images using low-power gas 
lasers. The significant departure from previous work rests in the use 
of very short laser pulses to record the image in the form of an array 
of discrete holes in the metal film and in the modulation of the in- 
tensity of the laser pulses so as to vary the size of the holes. Each 
laser pulse serves to machine a single, nearly circular hole in the metal 
film by displacing and removing metal from the transparent substrate. 
In this way a transparency is created, and incandescent light can be 
directed through the film to display the image on a screen. The de- 
livery of the laser energy to the metal film in short (20-30 nanosecond) 
pulses reduces the energy density required to raise the metal film to 
a given temperature by at least an order of magnitude over that which 
would be required for the same writing speed if a CW laser were used. 
Consequently, the average laser power required to displace or remove 
a given amount of metal per unit time from the transparent substrate 
is correspondingly reduced. In addition, the area of the transparent 
spot machined in the film varies in a nearly linear fashion with the 
pulse height, so that a good gray scale results. Equally good positive 
or negative images are produced, and successive, isolated machined 
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spots have a transmission factor independent of adjacent spots or 
position along the scan line. The spots were machined at the rate of 10° 
spots/second, and the frames consisted of about 2000 lines of spots 
with about 1200 spots in each line. 

The short laser pulses were generated with an argon ion laser in 
conjunction with an intracavity acoustooptic modulator**. The laser 
had an average power capability of 20 mW. The intracavity modulator 
served to deflect very short pulses from the laser cavity with much 
higher peak power but with an average power of approximately 
20 mW. 

In what follows, a model of the machining process will be developed 

based on electron micrographs of the machined spot. A calculation of 
the temperature rise of the film as a function of time is described and 
used to predict the laser power required to produce threshold ma- 
chining for pulses of a given length. 


II. DISCUSSION 


2.1 Temperature Rise of Thin Films Due to Laser Radiation 


A laser beam incident on a thin, optically absorbing film will cause 
the film temperature to rise.°-7 Assuming a laser power density P, 
transmitted through the film surface, the light intensity absorbed per 
unit thickness of the film at depth x may be described by 


d —2z Ps -2z 
Pkt) a da (Poe ”) = rag i (1) 


where 6 is the skin depth of the film, that is, the depth at which the 
light intensity drops by a factor e from its initial value. If the laser 
beam diameter is very large compared with the film thickness, the 
heat flow may be treated as a one-dimensional problem. It will be 
assumed that the thermal and optical* properties of the absorbing ma- 
terial are independent of its temperature and phase.t Under these con- 
ditions and assuming no heat loss by radiation, the temperature rise of 
the thin film and the substrate is described by the differential equa- 


- * The optical properties of thin bismuth films were measured and no significant 
change was detected below the vaporization temperature. The measurements were 
carried out by monitoring the amount of transmitted and reflected light from 
the bismuth film during a laser machining process. 

-+ While this assumption clearly limits the accuracy of the calculations, the 
experimental results presented later show reasonable agreement with the calcu- 
lations (See Fig. 14, for instance) and hence the error involved may not be 
very great. 
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tions of heat conduction:® 


aT, = a aT, = oe —2/6 

ds =k, Of ~~ OK, 8 (2) 
aT, 1 OT, _ 

a i Ok (3) 


where 7’; is the film temperature, TJ. is the substrate temperature 
measured relative to the ambient temperature, k; and ke are the ther- 
mal diffusivity of the film and substrate respectively, (k = K/pc), p 
is the density, c is the specific heat, K, and Kz are the thermal con- 
ductivity of the film and substrate respectively, and ¢ is time. The 
initial conditions, corresponding to the onset of illumination, are: 


at ¢ = 0, fi = Ts = 0. 
The boundary conditions are: 
at ¢>0 and « =0, 


ar 
Ox 


at ¢>0O and xwx=d 
(where d is the thickness of the metal film), 





T; = T2 
and 
of, ots 
Ky dx Ky Ox 


(equal temperature and heat fluxes at film-substrate boundary). The 
substrate is assumed to be infinitely thick, so that Tz = Oatx= 0. 
The temperature at depth x within the thin film can then be shown 
to be approximately equal to 


co 
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in which erfc is the complementary error function and ierfe is its in- 
tegral over the argument between limits x and o. 


A-1 
re a (5) 


r= ( ky 


All the terms in equation (4) which have +z should be repeated once 
with +a and a second time with —z. The first term for example is 
equal to: 


= — nai) 2n+id+e,. 2n+ ld—x 
aX IV kit a jienf 2~/tl + ierfe ne \ 

For the case of back illumination, where the laser beam is incident 
on a transparent substrate, the differential equations are the same as 
equations (2) and (3) with x replaced by d — z in the exponential 
term. The boundary and initial conditions are the same as in the pre- 
vious case. The temperature of the thin film in this case is approxi- 
mately equal to 


ee ee Oe ae ee (2u + Dd + 2) 
Tiw)bs K, > IV kite a’””* ierfe a0 or ae 


(Qn+ d+ *) 
2V yt 








Aa” 
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— IV kit e ” ierfe (a) 


a” (2n + ld + *) = cee, 
ies A erte ( Ei de (6) 


The model used above refers only to illumination by a step function 
of optical intensity. For comparison with experimental pulses having 
finite rise and fall times, a correction factor depending on the time 
dependence of the incident light pulse should be included in equations 
(4) and (6). For most practical cases in which d > 6 all the terms in 
equations (4) and (6) which contain the expression e’”’ are smaller 
than the other terms and can be ignored. In addition, for the cases 
in which the illumination duration ¢ is 10 ns or longer, Wki > 6 so 
that all other terms with 6 can be ignored. Numerical results based 
on equations (4) and (6) were obtained for the temperature distribution 
across the film and for the temperature dependence on the time from 








id 


1766 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


the onset of illumination. Of a number of metallic films examined, 
bismuth films required the least energy to machine a given size of 
spot at a given optical density. For this reason, the results presented 
herein all relate to bismuth films on glass and mylar substrates. 

The thermal constants of bismuth, glass, and mylar are given in 
Table I. The light. absorption in bismuth was measured, and 6 was 
found to be ~140 A for the 4880-A wavelength of the argon ion laser. 
The calculated temperature rise of the bismuth film as a function of 
the light pulse duration, with the film thickness as a parameter, is 
shown in Figs. 1, 2, and 3. Figures 1 and 2 illustrate the case of front 
illumination with mylar and glass substrates, respectively. The results 
in Figs. 1 and 3 are given for bismuth film thicknesses of 330, 670, 
and 1000 A and in Fig. 2 for film thicknesses of 330 and 1000 A. The 
temperature rise is shown both at the bismuth-substrate boundary 
(« = d) and the bismuth-air (cx = 0) surface. Figure 3, which refers 
to a bismuth film on mylar with the laser beam incident onto the 
mylar substrate, is seen to be essentially identical to Fig. 1. For long 
times, the temperature is proportional to ~/t (slope equal to that of 
the broken line on the figures). This same proportionality is obtained 
for the surface temperature of a semi-infinite solid with constant flux 
of heat at the surface.” 

As seen in Figs. 1, 2, and 3, for short times (<50 ns), the tempera- 
ture rise approaches a linear dependence on time such as one might 
expect if the thermal loss to the substrate were small compared with 
the absorption of thermal energy by the thin metallic film. This is 
especially so with the mylar substrate due to the lower thermal con- 
ductivity of the mylar as compared to glass. For the same reason, the 
temperature rise of the bismuth film on glass is smaller than that of 
the same film on mylar. 

The temperature distribution through the thickness of the film in 
the case of a 1000-A-thick bismuth film on mylar is shown in detail in 


TaBLE I—THERMAL CONSTANTS 














Conductivity 
[cal/(s)(em?) Diffusivity 
Substrate (°C/em)] [em?/s] a A 
Bismuth 2X 10° 7X 10 
Glass 28 xX 10-4 58 X 1074 
Mylar 4 x 10-4 1 xls 
Bismuth on glass 0.35 2.06 
Bismuth on mylar 0.72 6.30 
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Fig. 1—Temperature rise of a bismuth film on a mylar substrate caused by 
front illumination. 
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Fig. 2—Temperature rise of a bismuth film on a glass substrate caused by 
front illumination. 
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Fig. 3—Temperature rise of a bismuth film on a mylar substrate caused by 
back illumination. 


Fig. 4a and b. The temperature distribution is given for the case of 
front illumination at the end of 10 ns (Fig. 4a) and at the end of 100 
ns (Fig. 4b). The temperature gradient decreases with longer illumina- 
tion, which can also be seen from the previous figures. 

The temperature gradient across a 670-A bismuth film on mylar, 
illuminated from the back, is shown in Fig. 5. Results are shown after 
10 ns (Fig. 5a), after 100 ns (Fig. 5b), and after 1 us (Fig. 5c). As 
expected, for short illumination duration, the temperature is higher 
near the substrate interface, where most of the laser intensity is ab- 
sorbed. Because of loss of heat to the substrate, the location of maxi- 
mum temperature shifts in time towards x = 0, and the temperature 
gradient decreases (Fig. 5c). 

The temperature gradient across the film thickness is always very 
small (about 2 percent of surface temperature for the case described 
in Fig. 4b). For this reason, the model used for the foregoing calcula- 
tions, which assumes that a single phase of the film is in existence at 
any instant, is adequate up to the vaporization temperature. 


2.2 Optimal Machining Considerations 


It is assumed that the laser beam power density absorbed by the 
thin film has an azimuthally symmetric Gaussian shape of the form 
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T(z, r) = I,(2)e °"* (7) 


in which r is the radial distance from the axis, I(z, 7) is the laser in- 
tensity at a distance 2 from the focal plane along the optical axis, 
I,(z) is the laser intensity at r = 0, and w(z) is the radius of the waist 
at the 1/e points of the intensity. For a beam propagating in the z 
direction, H. Kogelnik and T, Li?® have shown that 


way Sas (#5) 

(we he Qnwe (8) 
where A is the light wavelength and w is the waist radius at the focal 
plane, w, = w(o). Assuming a laser beam with a specified total power 


P, one can write 


pe / Ee oa Sea (9) 
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Fig. 4—Temperature distribution across 1000 A of a bismuth film on mylar: 
(a) after 10 ns of front illumination; (b) after 100 ns of front illumination. 
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Fig. 5—Temperature distribution across 670 A of a bismuth film on mylar 
after 10 ns (a), 100 ns (b), and 1 ws (c) of back illumination. 


which defines 


1 P 
I,(2) 3 = wz)” (10) 
Machining or evaporation of the thin films with a spot diameter D 
and a given pulse duration requires a certain threshold intensity Ip = 
I(z, D/2). At the focal plane (2 = 0), it follows from equation (7) 
that 


ecod, oat 
Tx = ae (D/2w o) (11) 
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or 


PHalwee?’ 

where D is the machined spot diameter. By taking the derivative of P 
with respect to w, and setting dP/dw, = 0, it can be shown that the 
laser power required to machine a spot of given diameter is minimum 
when 


— 
ro (12) 
It follows that the optimal beam waist diameter machining a spot of 
diameter D is 


2Woot = D. (13) 


As an example, consider the machining of a 1000-A bismuth film 
with a mylar substrate. The laser power density absorbed in the film 
is assumed to have a Gaussian shape with a peak intensity I, equal 
to 3 x 10° watts/cm?. The melting temperature of bismuth is about 
271°C. From Fig. 3 it is found that starting from room temperature 
the thin bismuth film will reach the melting temperature after 16 ns of 
illumination time. The latent heat of fusion of the bismuth is L = 10.2 
cal/g, and the specific heat is C = 0.03 cal/g°C. The additional time 
At used in supplying the latent heat of fusion is estimated from the 
time required to raise the temperature by the amount AT = L/C = 
300°C. From Fig. 3 this time is found to be At & 20 ns. Following 
that, the temperature will continue to rise toward the evaporation tem- 
perature. The time required to reach the evaporation temperature after 
supplying the latent heat of fusion is found from Fig. 3 to be equal to 
124 ns. Assuming that machining is done under optimal conditions, as 
previously described, the temperature rise was calculated at different 
points across the diameter of the Gaussian beam. Results are shown 
in Fig. 6 for the temperature of the bismuth film after 160 ns illumina- 
tion duration with a laser peak intensity of 3 x 10° watts/em?. The 
four lines shown correspond to the temperature at a = 0 (the center of 
the machined spot, at a = 0.25D, at a = 0.50D (the edge of the ma- 
chined spot), and at a = 0.67D, which is the radius where the film 
just reached the melting point. 

When the laser beam is focused to a radius different from wopt , the 
machined spot diameter is reduced. The ratio between the diameter of 
the machined spot and the optical beam diameter is found from equa- 
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Fig. 6—Temperature distribution across the machine spot diameter. 


tions (7), (10), and (12) to be: 


D(z) 
2w(z) 


Suppose a beam is focused to a waist w,,, which is optimal for machin- 
ing a spot of diameter D.,. = 2w.»: . Let the depth of field be the 
distance | from the focal plane for which the machining spot diameter 
is reduced by a factor of V2. It is found from equations (8) and (14) 
that 





= V1 — 2 In (w(e)/woy)- (14) 


(15) 


As an example, to machine a spot diameter of 10.m, with an argon ion 
laser which has a wavelength of 0.488 A, the field depth according to 
(15) is | 


aaa 


2 
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III. EXPERIMENTAL RESULTS 


Machining experiments have been conducted on thin bismuth films 
using glass and mylar as substrate materials. These films were pre- 
pared by vacuum evaporation techniques in thickness of 200, 400, 500, 
600, 800, and 1000 A respectively. A folded-cavity, CW argon laser was 
used in conjunction with a fast acoustooptic modulator*® to provide 
light pulses whose length could be varied from 25 ns to several ms. The 
repetition rate of the light pulses could be controlled and varied from 
a single pulse up to several million pulses/second. The focused spot 
could be swept in one dimension using deflectors, such as galvanom- 
eters,! rotating mirrors, etc. However, for most of the data reported 
in this section, the focused spot was kept fixed, and the film was moved 
in a direction perpendicular to the optical axis of the light pulses, as 
shown in Fig. 7. Tilting the film at a small angle a with respect to its 
direction of travel (shown in Fig. 7) causes the bismuth film to pass 
from inside the plane of focus to outside in its travel past the focused 
spot. With sufficient power in the light pulse a line of circular spots 
was machined. 

‘Figure 8 shows four pairs of lines machined with four different light 
intensities. The spot diameters along these lines exhibit two maxima 
corresponding to the optimum case described in Section II [equations 
(12) and (13) ]. The two maxima are at equal distances from the spot 
machined when the film is at the beam waist. The pulse repetition rate 
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Fig. 7—Experimental setup. 
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Fig. 8—Four sets of lines machined with different intensities. 


and film speed were adjusted to produce nonoverlapping but nearly con- 
tiguous spots along each line; i.e., each spot was formed by a single op- 
tical pulse. The film in this case was a bismuth film of 1000 A thickness 
on a glass substrate. The optical beam was focused to a diameter 2w, of 
about 4um. The light pulse duration was 25 ns with peak intensities of 
1.6, 0.8, 0.5, and 0.25 watts for the first, second, third, and fourth pairs 
of lines, respectively. The maximum spot diameter (which occurs at 
the brightest part of each line in Fig. 8) and the separation between 
the two maxima in each individual line increase with the intensity of 
the incident light. The machining obtained when the beam was in focus: 
on the bismuth film is marked by the vertical line in the center. The 
disappearance of machining at the two ends of each line occurs where 
the peak power at the center of the beam falls below the machining 
threshold power I7. . 

A more detailed picture of some individual spots machined with the 
same laser intensity is given in Fig. 9. These photographs were ob- 
tained from a scanning electron microscope at 10,000 and 2000 times 
magnification. The approximate locations of these spots are indicated 
by the letters a, b, c, d, and e in Fig. 8. All spots shown in Fig. 9 are 
located on the top line of machined spots in Fig. 8. The individual 
machined spot magnified by 10,000 times in Fig. 9 is also marked by a 
small arrow on each 2000 magnification photograph. The machined 
spot obtained when the laser beam was focused in the plane of the bis- 
muth film is shown in (a) of Fig. 9. Machining obtained with a power 
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Fig. 9—Photographs of individual machining spots obtained from a scanning 
electron microscope. 
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just above threshold is shown in (d). The largest machined spot ob- 
tained is shown in (b). As clearly seen from these photographs, very 
little, if any, of the bismuth film was evaporated at the lower power 
densities, as shown in (c) and (d). Near the threshold power, the metal 
was only melted and the surface tension pulled the metal back from the 
center forming a crater, as shown in (c). In the machined spot at the 
center of (c), surface tension caused a large percentage of the liquid 
metal to resolidify into a sphere within the crater. In (e) the laser 
power was just below the threshold level, and while the metal appears 
to have been melted at each spot, only in a few cases were craters 
formed. 

In (a) and (b) of Fig. 9 the optical beam intensity at the center of 
the spot is greater, and it is likely that some evaporation of material 
occurred in this region. The metal vapor pressure has forced some of 
the liquid metal away from the substrate, and surface tension has 
caused the remainder of the liquid bismuth to resolidify in the irregu- 
lar form evident in (a) and (b). By estimating the volume of material 
accumulated around the crater in (b), the amount of material evapo- 
rated for the optimum machining is found to be less than 50 percent of 
the total metal displaced from the crater. 

In order to measure the machining field depth, a similar experiment 
was carried out. Argon laser pulses with 25 ns duration and peak power 
of 0.75 watt were focused to a minimum diameter of 2um. The bismuth 
sample was again moved perpendicular to the optical axis and was 
tilted at a small angle a with respect to its direction of travel. Assum- 
ing that the beam has a Gaussian shape, the optical beam diameter 
could be calculated for each machined spot. The relation between the 
machined spot diameter and the light beam diameter is given in Fig. 
10. The results are given for an 800-A-thick bismuth film on a glass 
substrate. The solid line is the theoretical line given by equation (14). 
As seen in this figure, the observed machining field depth is extended 
beyond that of the theoretical curve, especially at the lower power 
densities. This may result from the surface tension effect near the 
threshold power. 

Figure 11 shows the effect of varying the laser intensity. A 400-A- 
thick bismuth film on a mylar substrate was used in this case. The 
optical beam was in focus all along the machined line, and the machin- 
ing was obtained by back illumination. As indicated on this figure, the 
beam intensity was varied from 100 percent to 24 percent, the 100-per- 
cent intensity corresponding to a light pulse of 1 watt peak intensity 
with 25 ns duration. The machining spots varied in diameter according 
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Fig. 10—Relation between machining spot and light beam diameters. The 
notation is that used in equations (13) and (14). 


to the incident light intensity, from about 8 to lum. The optical beam 
diameter was 2 w, ~ 6um. As shown in Fig. 12, above threshold the 
measured machining area varied linearly with the light beam intensity 
for these particular machining conditions. 

The variation of threshold machining power density with pulse dura- 
tion was measured for pulse durations in the range from 25 to 4000 ns. 
The light pulse shapes for several durations are shown in Fig. 138. The 
results obtained with a 600-A-thick bismuth film on mylar (illumi- 
nated from the substrate side) are given in Fig. 14. The solid line is 
the theoretical curve based on the results given in Section II. The small 
circles are the experimental results. Both the theoretical and the ex- 
perimental results in Fig. 14 have been normalized so that they coin- 
cide at the 1-ys illumination period. The small deviations from the 
theoretical results for the short durations are not yet explained. 

The peak pulse power required to machine 6-m-diameter spots on 
500-A bismuth films on mylar with 25 ns pulse duration was found to 
be of the order of 0.7 watt. This corresponds to an average energy den- 
sity of 0.06 joule/cm? over the area of the spot. As theory predicted 
(Figs. 1 and 8), the same power requirements were found for both back 
and front illumination. 

Figure 15 shows the dependence of the energy density, required to 
machine 6-~m-diameter spots with laser pulses of 25 ns duration, on 
the film thickness. The energy density plotted in Fig. 15 is the peak 
pulse power times the pulse duration divided by the spot area. For 
optimal machining, this is also equal to the peak power density I, at 
the center of the beam times the pulse duration [from equations (10) 
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Fig. 11—Machining spots obtained with different light intensities. (The 100- 
percent value corresponds to a peak intensity of 1 watt with 25 ns pulse duration.) 


IMAGE RECORDING BY LASER BEAMS 1779 


60 
50 
< 
se 
cnr 
<zZ 40 
2 
o> 
ut 
< 30 
OF 
20 
Za 
rat 20 
LZ 
= 
10 
10) 
0 20 40 60 80 100 


INTENSITY IN ARBITRARY UNITS 
Fig. 12—Relation between machining spot area and illumination intensity. 


and (13), I, = P/(7D?/4)]. The films were on mylar substrates and 
the machining was from the substrate side. As expected from the 
theoretical results presented in Figs. 1, 2, and 3, due to some loss of 
energy to the substrate, the required machining power increases rela- 
tively slowly with film thickness in the range from 200 to 1000 A. 

The power densities required to machine different thicknesses of 
bismuth films with different illumination duration can be estimated 
from Figs. 14 and 15. For the example given at the end of Section IJ, 
the peak power density required to machine a 1000-A film with a 160- 
ns illumination duration, according to Fig. 14, is 4.5 times smaller than 
the peak power density with a 25-ns illumination duration. From Fig. 
15 it is found that the energy density required to machine a 1000-A film 
with a 25-ns pulse duration is 0.085 joule/em?. Assuming that due to 
reflectivity only 50 percent of the laser light is absorbed in the film, the 
peak power density J, is then of the order of 0.5 X 0.085/(4.5 X 25 x 
10°) = 3.8 X 10° watts/cm?. This is in agreement with the theoretical 
prediction presented in Fig. 6 and with the experimental results of Fig. 
9 which indicate that only a small portion of the film is evaporated. 

Back illumination machining experiments were carried out and com- 
pared for Mylar S* and Mylar D* substrates. Both of these materials 
have similar thermal constants. The Mylar S however has a better 
optical quality. The same power densities were required for machining 
in the two cases. As experimentally observed, due to the poorer optical 


* Trademarks of DuPont. 
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Fig. 183—Diffcrent light pulse durations obtained by the modulation system. 
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Fig. 14—Relation between threshold machining powers and illumination duration. 


quality of the Mylar D, the optical beam shape was distorted causing 
a distortion in the shape of the machining. In Fig. 16a, b, and c, ma- 
chining spot shapes are shown and compared for front and back illumi- 
nation on Mylar D substrates and back illumination on Mylar S. As 
seen in Fig. 16a, the spot shape is distorted for the back illumination 
on Mylar D. 

Many bismuth particles in the vicinity of the machining area can 
be observed in Fig. 16. These apparently were displaced as molten 
drops by the vapor pressure during machining. 
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_Fig. 15—Dependence of the energy densities, required to machine 6-ym- 
diameter spots, on the bismuth film thickness. 
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Fig. 16—Machining spot shapes with different substrates for front and back 
illumination: (a) Bismuth on Mylar D, back illumination. (b) Bismuth on Mylar 
D, front illumination. (c) Bismuth on Mylar 8, back illumination. 
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IV. IMAGE RECORDING ON THIN BISMUTH FILM 


The use of laser beams to record information on thin metallic film 
has been demonstrated by several authors.'?:1#1% C, O. Carlson, et al.* 
used a 388-mW CW He-Ne laser to write video information on various 
kinds of thin films on a glass substrate. The thin films used were 500 A 
of lead and tantalum and lym of a triphenylmethane. A negative 
image of a page was formed by micromachining the thin film with the 
laser beam which was modulated at a video frequency. Machining 
lines had a width of 24m with a total image size of 0.24 x 0.24 mm. 
The writing speed was equivalent to about 10° spots per second. The 
energy density required was 1.2 joules/cm?. 

A 100-A-thick bismuth film on a glass substrate has been utilized 
to record holograms of various patterns with a pulsed ruby laser.? 

In the present work, an intracavity modulated argon laser of average 
power capability, 20 mW, has been utilized to record images on a 600- 
A-thick bismuth film on a mylar substrate. The recorded image size 
was 10 by 12 mm and the writing speed was 10° spots per second. Im- 
ages of documents which were originally 8.5 x 11 inches have a resolu- 
tion of over 175 lines/inch when magnified to the original size. The 
picture quality obtained is comparable with a good black and white 
photograph with 8 to 10 shades of gray. The energy density (given by 
the pulse peak energy divided by the spot area) had a maximum value 
of about 0.060 joule/cm’. 

The system is illustrated in Fig. 17. An acousto-optical intracavity 
modulation system was used to obtain 25-ns-duration light pulses from 
a CW argon laser producing 20 mW average power in the Gaussian 
mode. The modulation technique preserved the average output power 
of the laser. The video information was acquired by a He-Ne laser 
flying-spot scanner. A balanced mixer amplitude modulated a 5-mW 
450-MHz RF signal with the video signal. By supplying a de bias in 
addition to the video signal, the modulation depth could be adjusted to 
obtain the proper gray scale. A second balanced mixer, triggered with 
30-ns-duration base band pulses at a 1-MHz repetition rate, was used 
to obtain the RF pulses required to dump the cavity. The video-mod- 
ulated 30-ns-duration RF pulses, after being amplified by a 20-watt 
linear amplifier, were fed directly into the ZnO transducer sputtered on 
the fused silica modulator. As a result of a Bragg interaction between 
the acoustic pulses and the light beam, light pulses with the duration of 
25-30 ns were dumped out of the cavity at a 1-MHz repetition rate. 
The pulses were intensity modulated at the video frequency. The light 
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Fig. 17—The system setup for recording video information by laser machining. 


pulses after being collimated were focused with an f£/5.6 lens onto the 
bismuth film so as to machine 6-y~m-diameter spots. The light pulses 
were deflected in the horizontal direction by a galvanometer mirror. 
The 16-mm mylar substrate with the bismuth coating moved at a 
constant speed in the vertical direction. To prevent the bismuth vapor 
from coating the optical components, the laser beam was incident 
on the thin film through the mylar substrate. The writing speed was 2 
ms per line including 40-percent retrace time. The number of lines 
printed was about 2000 lines per frame (4-second frame with 2.4 x 
10° total number of resolvable spots). Figure 18 shows a photograph 
of a micromachined copy of an [EEE facsimile test chart. As seen in 
this figure, about 8 to 10 shades of gray were obtained. A magnified 
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Fig. 18—Microfilm recording of a facsimile test chart. 


region of an all-white area is shown in Fig. 19. A region of several gray 
scales (a detail of the eye in Fig. 18) is shown in Fig. 20. 

By changing the polarity of the video signal (Fig. 17) either posi- 
tive or negative images could be recorded. In Fig. 21, magnification of 
both positive and negative images of newspaper clippings are shown. 


Vv. CONCLUSION 


Calculations and experiments have been carried out for the machining 
of thin metallic films by a laser beam and for the associated temperature 
rise of the film. For a long illumination duration t, depending on the 
thermal constants of the film and substrate and on the film thickness, 
the temperature to which the film has risen by the end of the light 
pulse is proportional to ~/¢. For a very short illumination duration, 
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-however, the temperature rise approaches a linear dependence on 
time. For longer light pulses, a larger portion of the laser energy is 
lost to the substrate by heat conduction during the pulse. Machining 
of thin films on substrates of very low thermal conductivity is most 
efficiently obtained when the pulse duration is short enough that the 
temperature rise is linearly proportional to the pulse duration. 

With optically transmitting substrates, machining could be obtained 
either by front illumination or by back illumination. The machining 
efficiency is approximately equal for the two methods. In many cases, 
however, back illumination is advantageous since the vapor from the 
surface is directed away from the focusing and deflecting optical com- 
ponents. By varying the laser pulse intensity, the machining spot area 
varies nearly linearly with the pulse intensity over a certain range 
above some threshold value. A better understanding of the dynamics of 
the machining process, however, is still required and further work is 
planned. . 

The use of very short laser pulses permits the machining of images 
of a given sizé and resolution in a given length of time with much 
lower average laser power than would be required with a CW laser. 
By varying the intensity of the laser pulses and hence the area of the 
machined spots, images with a continuous range of shades of gray 
were obtained. Images of printed material having a resolution of about 
175 lines per inch of an original 83 X 11 inch document were machined. 
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The Coupling of Degenerate Modes in 
Two Parallel Dielectric Waveguides 


By D. MARCUSE 
(Manuscript received February 10, 1971) 


Two parallel dielectric waveguides can exchange energy if the field 
carried by one guide reaches the other guide. We consider only the case 
of coupling between degenerate modes of dielectric waveguides (optical 
fibers, etc). Degenerate modes have equal phase velocities, but their 
transverse field distributions need not be identical. 

The coupling theory presented in this paper applies to dielectric 
waveguides of arbitrary shape and arbitrary distribution of refractive 
index. The dielectric media of the guides as well as the surrounding 
medium are allowed to be lossy. The coupling coefficient 1s obtained 
by means of perturbation theory. It is shown that whereas lossless 
degenerate modes can exchange their power completely, lossy modes 
tend to equalize their power. 

The theory is applied to the problem of crosstalk between cladded 
dielectric slab waveguides and cladded optical fibers embedded in a 
lossy medium. 

Since a lossy surrounding medium also causes an increase in the loss 
of the guided modes, formulas for this additional loss are presented. 
It is shown that additional mode loss results even for a lossless sur- 
rounding medium tf n3k > B. (ng = index of surrounding medium, k = 
free space propagation constant, B = propagation constant of guided 
mode.) 


I. INTRODUCTION 


Optical fibers appear attractive as waveguides for the transmission 
of light. Since many such fibers are likely to be bunched together to 
form a cable, the problem of crosstalk between the different fibers of 
the cable arises. There are several sources of crosstalk. Light scattered 
in one fiber may excite a guided mode of an adjacent fiber. This 
double scattering mechanism is discussed in the next paper in this issue. 
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The present paper is concerned with the direct coupling of power be- 
tween two parallel dielectric waveguides. Even though it addresses itself 
to crosstalk in optical fibers the theory is more general and applies to 
dielectric waveguides of arbitrary distribution of the refractive index. 
The modes carried by the two guides need not be identical. However, 
we assume that the phase velocities of the modes in either guide are 
identical. We call modes with identical phase velocity degenerate 
modes. We furthermore admit the possibility that the media of the 
waveguides as well as the surrounding medium in which the guides are 
embedded may be lossy. The restriction to degenerate modes is no 
serious limitation for cases of practical interest. Provided that the 
coupling coefficients are constant, appreciable coupling of power from 
a mode in one guide to a mode in the other guide is possible only if 
the modes have identical phase velocities. 

The coupling theory is based on finding the change of the propaga- 
tion constant of the mode of one guide caused by the presence of the 
other guide. This perturbation theory is directly based on Maxwell’s 
equations and is thus independent of the composition and geometry 
of the coupled waveguides. 

We obtain the coupling coefficient in form of an integral over the 
cross section of the waveguides. The integral is taken over the scalar 
product of the electric field vectors of the two guides. The general 
coupling formula is then used to obtain the coupling between two 
cladded slab waveguides as well as the coupling between two cladded 
round optical fibers. The crosstalk between these two types of wave- 
guides is discussed. The crosstalk can be reduced by inereasing the 
separation between the two guides and by increasing the loss of the 
medium in which the guides are embedded. Coupling between dielectric 
fibers has previously been treated by A. L. Jonest and by R. Van- 
clooster and P. Phariseau.? However, these authors consider only 
uncladded lossless fibers. 


II. COUPLED LINE EQUATIONS 


The general properties of coupled waveguides can be studied with 
the help of coupled line equations. Neglecting the possibility of 
coupling to modes traveling in the opposite direction we can write the 
general coupled line equations of two degenerate modes as follows: 

dA 


= —iBA + ¢,B, (1a) 
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a 2 —78B +A. (1b) 

dz 
The propagation constant @ is assumed to be the same for either 
mode; A is the amplitude of the mode of one guide and B that of the 
mode of the other guide. If each guide can support more than one mode 
these modes would have to be included in the coupled line equations. 
However, it is well known that only modes with equal phase velocity 
exchange a significant amount of energy if they are coupled together 
by a length-independent coupling mechanism. The restriction to only 
two modes is thus an excellent approximation. The coupling coefficients 
Cy, and c. are allowed to be different since the two modes can be 
different even though they are degenerate. However, if the waveguides 
are lossless, conservation of power imposes the condition® 


q = —c%. (2) 


The asterisk indicates complex conjugation. 

In the following analysis we assume that 6 as well as c; and c, may 
be complex quantities since we want to include the case of lossy modes. 
Equation (2) is thus not assumed to hold exactly. Since @ as well as 
Cy and cy, are constants we can immediately solve the coupled line 
equations and obtain solutions in the form 


A(2) = 1 aes + gine) és (2) Bet = ceyheim, (3a) 


Be) — 1 faye + ere) +: (22) Aye **" aa enhanit, (3b) 
1 
with 
AB = iVee . (4) 
The coefficients Ap and By are the field amplitudes A and B at z = 0. 
It is apparent that equations (8a) and (3b) are composed of the 
superposition of two new normal modes with propagation constants 
6, = 6B + AB (5) 
and 
B. = B — AB. (6) 


For complex values of c; and cz we obtain also a complex value of the 
change of the propagation constant Af. In that case, it is clear that 
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the two new normal modes do not only have slightly different propaga- 
tion velocities but also suffer different amounts of loss. 

For the study of crosstalk we need the solution with By = 0. That 
means we assume that at z = 0 only mode A is excited. If we restrict 
ourselves to the case 


| ABz|<«1 (7) 
we find that the ratio of the amplitude B to amplitude A for distances 
z = L for which (7) is valid is given by 


3 
ai = ~ (2) ABL. (8) 


The absolute square of the ratio of B/A is the power ratio at z = L 
in the two guides. Using the letter V for this power ratio we obtain the 
following useful formula for the distance Z at which the power ratio 
in the two guides is V: . 


3 (Vv)? 
[ae | ©) 


For low-loss modes equation (2) must hold at least approximately so 
that we obtain from (9) in the low-loss case 


y? 
~ [AB | 


Cy 
C2 








L (10) 
The low-loss assumption refers only to the actual loss suffered by the 
modes propagating inside of the guides. The loss of the surrounding 
medium in which the guides are embedded can be arbitrarily high. 

Finally we consider two special cases. If we assume that the two 
guides are lossless and assume again By = 0 we obtain from equa- 
tion (3) 


A(z) = Ag cos (Afz)e**’, (11a) 
Be) = - (2) a, sin (ABz)e"***. (11b) 


In the lossless case assumed here Af is real and it is clear that the 
two guides exchange power after a distance D given by 


D= ORE (12) 


If, on the other hand, Af is complex, as it would be in the case of lossy 
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guides, and if we make 2 sufficiently large that the exponential factor 
exp(iA8z) is much less than unity, we obtain again for By = O from 
equation (3) 


A) = 4A,e *"**?* (13a) 


and 


Bi) = 1 Agee (13b) 


Since the absolute value of the ratio c./c, is nearly unity we see that 
the power carried by the two guides equalizes in the lossy case pro- 
vided that we let both modes travel far enough. This power equaliza- 
tion may occur after many power exchange cycles or it may occur 
even before one cycle is completed depending on the magnitude of 
the imaginary part of Af. Of course both modes suffer additional loss 
because of the complex value of the propagation constant 8. The 
magnitude of 8 is much larger than that of Af for this discussion to 
be meaningful. . 

The preceding discussion shows that the crosstalk between the two 
coupled guides is determined by the change in propagation constant 
AB and not by the individual coupling constants c; and c.. It is thus 
sufficient to determine Af. 


Ill. MODE COUPLING THEORY 


We assume that two dielectric waveguides are positioned parallel 
to each other. Each guide is a cylindrical structure with a refractive 
index distribution that is independent of the z coordinate (z is the 
direction parallel to the axis of the structure) but which can have an 
arbitrary dependence on the transverse x and y coordinates. However, 
it is assumed that the index distribution is such that it produces a 
dielectric waveguide capable of supporting guided modes. We assign 
a refractive index distribution n; to guide one assuming that n, has 
the constant value of the refractive index nz; of the background ma- 
terial in the region occupied by guide two. A similar assumption 
applies to the index distribution of guide two. It is assumed to have 
the constant value nz; of the background in the region occupied by 
guide one. The square of the index distributions n; and ny is shown in 
Fig. 1. Since it is the square of the refractive index that enters 
Maxwell’s equations we use the following expression | 


n? = (ni — n3) + (np — 3) +73. (14) 
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Fig. 1—Distribution of the square of the refractive index used to describe the 
two dielectric waveguides. 


Owing to the definitions of n(x, y) and n2(a, y) it is clear that (14) 
becomes n’ = n? in the region of guide one, n’ = n2 in the region of 
guide two, and n? = nz outside the regions of either guide. The re- 
fractive indices n,; , n2 , and nz need not be real quantities but are 
allowed to be complex if the media are lossy. 

The starting point of the perturbation theory is Maxwell’s equations 


VXH = iwen’E | (15a) 
and 
V XE = —twp oH. (15b) 


The time dependence of the fields was assumed to be given as exp (tw). 
The constants ¢) and po are the permittivity and permeability of the 
vacuum and the square of the refractive index is given by (14). Next 
we introduce the perturbation assumption that the electric field E 
and the magnetic field H are very nearly given by the superposition 
of the mode fields of each guide in the absence of the other. We thus 
write . 


E = aE, + bE, +e (16) 
and 
H —= aH, + bH, + h. (17) 


The coefficients a and 6b are being determined by the theory. The 
field E, , H,@ = 1, 2) being a mode of guide v has the z dependence 
exp (—7@z). It is assumed that the propagation constant is the same 
for the modes of either one of the guides in the absence of the other. 
As indicated in Section I, such modes are here called degenerate. The 
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electric and magnetic fields of the unperturbed modes of either guide 
satisfy the equations 


V. XH, — 76.(@ X H,) — iwen2E, = 0 »v = 1,2 (18a) 
and 
V.X E, — i8(2 X E,) + top H, = 0. (18b) 


2 is a unit vector in z direction. The symbol V, indicates the transverse 
part of the V operator. The electric vector e and the magnetic vector h 
appearing in (16) and (17) are assumed to be small perturbations 
that are needed to express the change in the field shape resulting from 
the close proximity of the two guides. In addition to the changes in 
the field shape we must allow for the possibility of a change in the 
propagation constant which can be assumed to be small if the coupling 
of the two guides is sufficiently loose. We thus assume that the new 
fields E and H of (16) and (17) have a z dependence of the form 
exp (—76z) with 


B = Bo + AB. (19) 


We have seen in the last section that the change A of the propagation 
constant determines the coupling of the two guides. It is thus the 
quantity we are most interested in. 

We substitute (16) and (17) into (15a) and (15b) using (14). After 
cancellation of a number of terms with the help of (18a) and (18b) we 
are left with the following system of equations: 


Vi Xb — i8o@ X h) — tweol(ni — 3) + (2 — ms) + nile 
= idp(a X H, + b2 X HL) 
+ tweo[(n2 — n3)aE, + (ni — n3)bE2] — (20) 
and 
V.Xe— iB(é Xe) + twuoh = 1AB(ad XE, +08XE,). (21) 


Products of A@ with e and h have been omitted from (20) and (21) 
since they are small of second order. 


Since we are interested in obtaining A@ from our theory we must 
eliminate the dependence on the spatial coordinates from (20) and 
(21). Concentrating for the moment on the left-hand sides of these 
equations we obtain by scalar multiplication of (20) with E, and of 
(21) with H, and by subtracting the two equations and integration 
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over the entire cross-sectional area 
| B00. Xb = 68, X bh) — tweo((nk — 18) + (05 — nf) + nie] 


— H,-[V, X e — 78,6 X e) + touch]} de dy. (22 


The terms inside of the square brackets are small of first order. We 
now assume that the field intensity of E, in the region of guide two 
has decayed sufficiently to render it also small of first order. Since 
n; — nj is different from zero only in a small area in the vicinity of 
guide two the product of E, with (n} — nj)e is a term that is small 
of second order. In first order of perturbation theory this term can 
be neglected. Performing a partial integration on the terms containing 
the V, operator and rearranging the terms of the mixed products 
allows us to write 


, {h-[V, x E, + 1B (2 x E,) = LwpoH, | 


=e e-[V, x H, ae 1B (2 x H,) ais tween E,]} dx dy. (23) 


Comparison of (23) with (18) shows that this expression would vanish 
if instead of E, and H, we had used the complementary fields Ej and 
H7 that result from the original field if 8) and w are replaced by —£, 
and —w. We have thus shown that by scalar multiplication with E; 
and Hj, subtraction, and integration over the entire cross-sectional 
area the field terms e and h can be made to vanish from (20) and (21). 
A similar procedure can, of course, be carried through using EZ and H,. 
We thus obtain from (20) and (21) the following set of equations: 


al ace. [ (HA, X E, + H, X E}) dr dy 
+ we / (nz — n3)E,-E, dx ay | 


+ we, / (ni — n3)E\-E, dx ay| = 0 (24a) 
and 


ol ae. [ (H, X E, + H, X E;) dx dy 
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+ weo / (ns = n3)E2°E, dx ay | 
+ | ase. | (H2 X E, + H, X E;) dx dy 


+ WEo : (ni = n3)E>:E, dx ay | = (0). (24b) 


‘Equations (24a) and (24b) still contain a number of second-order 
terms. Products of field terms with different indices are small of first 
order since the modes of the two guides are supposed to overlap only 
slightly. Products of field terms with different indices that are mul- 
tiplied by the first-order quantity AG can thus be neglected as being 
of second order. The term (n3 — n3)E;-E, is also of second order since 
it involves the square of the amplitude of E, at the location of the 
opposite guide. Neglecting it and a similar term results in a consistent 
equation system containing only first-order terms. 


ange. [ (H, X E, + H, X E)) dz dy 


Ba ee / Ge aE aay SO ssa) 
and 


Aweo if (nz; — n3)E:E, dx dy 


+ base. [ (Hz XB, -+ H, X Ea) drdy =0. — (25b) 
The equation system (25) allows us to determine the coefficients a 
and b. Since we have a homogeneous system of equations we must 


require that the system determinant vanishes. This latter condition 
leads to a determination of Af. 


AB = =kweo 
| i (ni — n3)E{-E, dx dy : (nz — n3)E2-E, dz dy I 
if 2-(E, X Hi+E; XH) dz dy / é-(E, X Hi+E;z X H,) dx ay| 


(26) 
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The ratio of a/b is of no interest to us so that we do not need to write 
down the solution of (25). 

Equation (26) is the final result of our coupling theory. In the general 
case of lossy dielectric media AG is a complex quantity. Equation (26) 
is very general and holds for any two dielectric waveguides carrying 
degenerate modes. The field expressions E; and H; are obtained by 
changing the sign of 8 and w in the regular expressions for E; and H; . 
In the special case of lossless media E; and H; become simply the 
complex conjugates of the original fields. 


Ep 2B 
H; = H* 


Equation (26) can be simplified in another special case. If both wave- 
guides are identical, carrying the same mode, the two types of integrals 
appearing in the numerator as well as in the denominator of (26) 
become identical for reasons of symmetry. The expression for identical 
modes of two coupled identical waveguides can thus be written more 
simply 


forrealn,; . (27) 


| ci - ndE:-8, de dy 
B= 06 (28) 
[Gx By + BF XH) de dy 


For real refractive indices the denominator of (28) is equal to 4P, P 
being the power carried by the mode of one of the guides. It is neces- 
sary to normalize the mode fields of each guide so that field 1 as well 
as field 2 carry equal power P. This normalization was already 
implied in converting (26) to (28). Even if the medium surrounding 
the two waveguides is lossy it is possible to use the relations (27) and 
the identification of the denominator of (28) as 4P provided that the 
surrounding medium has a very small influence on the mode fields 
carried by each guide. It is thus permissible to use (28) in the form 


WEo 


AG = 4p 


| i - ndES-B, dx dy (29) 
in the special case of lossless dielectric media or, at least approxi- 
mately, even in the case that the surrounding medium is lossy if 
only the fields are so well guided that only a very small amount of 
power reaches the lossy surrounding medium. 
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IV. APPLICATION TO CLADDED SLAB WAVEGUIDE 


The dielectric slab is the simplest type of dielectric waveguide. 
Because of its simplicity much insight into the performance of dielec- 
tric waveguides can be gained by studying the slab waveguide. We 
consider two identical cladded slab waveguides as shown in Fig. 2a. 
Instead of solving the mode problem of the cladded slab waveguide 
exactly, we assume that the cladding is thick enough so that only a 
small amount of power reaches the surrounding medium. This assump- 
tion allows us to obtain approximate field solutions that are much 
simpler than the exact solutions. This case is, moreover, of the most 
practical interest since the cladding is meant to protect the guided 
mode from the disturbing influence of the surroundings. Our approxi- 
mation thus coincides with the case of real practical interest. Without 
going into the details of the calculation we state only the results. The 
field of one of the two guides can be expressed as follows: (For 
simplicity the field of one guide is expressed in the coordinate system 
shown in Fig. 2b.) 


A COS kx Osxrzxzd 
E, = \Be’* + Ce” d<x2x<D 
Fe °* Des roe) . (30) 


The field is continued symmetrically for « < 0. Equation (80) gives 
the H component of the symmetric TE modes of the slab waveguide. 


fom CORE 7 CLADDING 





Fig. 2a—Sketch of two coupled cladded slab waveguides. 
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ed 


oe 
(b) 


Fig. 2b—Sketch of a single slab waveguide used for equation (30). 


The other field components are obtained from E, by the relations 





i Ok, 
ia wg Oz (31) 
and 
t OH, 
one Wo Ox (32) 


The dependence of the field on time ¢ and on the z coordinate is, 
as always, given by the factor 


Pak aes (33) 


which is being omitted from the equations. The parameters appearing 
in (30) are defined as follows: (k = w~W€ouo) 


k = (n2k? — By}, (34) 
y = (6? — n2k’y}, (35) 
p = (8° — n3k’)?. (36) 


Ne, Mm, and ng are the refractive indices of core, cladding, and outer 
medium. The boundary conditions lead to relations between the 
constants A, B, C, and F: 


A K Y— Pp paid goae Daal (37) 


—_ | € 


= Ae’? cosxd, (38) 
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Quy A ; eee TP oN (39) 


~ + a? +7) 
The coefficient A can be related to the power P carried by the mode, 


2wyP 


pa + 8 
Y 


A= (40) 


The coefficient / depends exponentially on the factor —y(D — d). For 
increasing cladding thickness this factor becomes very small. However, 
the factor B vanishes even more rapidly since it depends on the square 
of the same exponential decay factor. This shows that the field inside 
of core and cladding is altered only very slightly if the cladding is 
sufficiently thick. The eigenvalue equation differs from the eigenvalue 
equation for infinite cladding only by a term that also vanishes as 
the square of the aforementioned exponential decay factor. To a good 
approximation we are thus justified to use the eigenvalue equation 
for the infinite cladding guide. 


tankd = ~. (41) 


K 


Once the mode fields are known it is an easy matter to evaluate 
the coupling coefficient (29). We obtain 


Axy = e271 (D-4) —p(R-2D) 

“8 = Ba + ya)y + (+ 7)° : = 
(R is the distance between the core centers of the two slab wave- 
guides.) In case that the cladding material is identical to the sur- 
rounding medium we have p =. y and (42) reduces to the expression for 

coupled slab waveguides to be found in N. S. Kapany’s book. 
The same mechanism that is responsible for the coupling between 
the two slab waveguides also causes loss to the modes in each of the 
guides. Coupling occurs because some of the field in one guide reaches 
the region of the other guide. However, fields reaching out into the 
lossy medium between the guides also cause power loss to the mode 
traveling inside of the guide. Only if the surrounding medium is truly 
lossless can there be no additional loss to the guided mode. I calcu- 
lated the mode loss contributed by the surrounding medium assuming 
that it is infinitely extended and also ignoring the presence of the 
other guide. The power loss can be obtained to a good approximation 
by computing the power flow in transverse direction to the guide 
axis at the boundary between the cladding and the surrounding 
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medium. The validity of this approach hinges again on the fact that 
most of the power is carried inside of core and cladding. The approxi- 
mate calculation leads to the following expression for the power loss 
2a of the mode (a is the amplitude loss coefficient) . 


—2 7 (D-d) 


Sx°y* Im (p)e c 
Bi tye +7) ly+ ef? 
It is assumed that n, and n, are both real; ng is allowed to assume 
complex values. This loss expression has two interesting features. It 
shows that the loss decreases exponentially with increasing cladding 
thickness. It also shows that the loss is dependent only on the imagi- 
nary part Im(p) of p. For p to be real, two conditions must be satis- 
fied. First of all, the refractive index of the surrounding medium, nz, 
must be purely real. This means that the mode suffers loss whenever 
the surrounding medium is lossy. However, even if the surrounding 
medium is lossless, it is still possible for p to be purely imaginary. 
This happens when n3k > £8. In this case there is no total internal 
reflection between the cladding and the surrounding medium and the 
evanescent field tail reaching into the surrounding medium does no 
longer decay exponentially but begins to radiate. Power is thus lost 
from the guided mode even though all the materials of the guide and 
the surrounding medium are lossless. 

Finally, we state the result of the coupling calculation and mode 
loss for the cladded slab waveguide carrying the symmetric TM mode. 
For the details of the field distribution of this mode in an infinite 
cladding guide see Ref. 5. The result of the evaluation of (29) in ue 
case is 


2a = (48) 


2.2 2 2 (y—p) (D—-d) —27(D-d) —p(R-2D) 
TS ES AC cea rr I nA (44a) 
Brn + ney*)yd + nnn + 7')] 
The power loss caused by the lossy surrounding medium is 
Snin;, | nz |x’? Im (2 s)rarene 
2a = us (44b) 


Blyd@i? + iy?) + nnn? + -Y)) | nae + nay PP 
It is apparent that the coupling expression (42) for the TE mode 
differs from that for the TM mode. The coupling as well as the loss, 
thus turn out to be polarization dependent. 


Vv. COUPLING OF THE HE,; MODES OF ROUND CLADDED FIBERS 


In complete analogy to the cladded slab waveguide we now proceed 
to a discussion of coupling between HE; modes in two round cladded 
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Fig. 3—Cross section of two coupled cladded fibers. 


fibers, Fig. 3. Again, we assume that the cladding is sufficiently thick 
to allow us to treat the field inside of core and cladding as being 
affected only very little by the presence of the surrounding medium. 
With this approximation in mind, we state the relevant equations for 
the mode field of a cladded fiber. 

The field inside of the fiber core is described by the set of equa- 
tions® 


IIA 


E, = AJ,() cos “| Gee (45) 


H, = BJ,(kr) sing 
the field inside of the cladding is given by 
E, = (CH; (tyr) + DH,” (iyr)] cos ¢ 
H, = (FH; (ivr) + GH}? (ay7)| sin ¢ 
and the field in the surrounding medium is 
E, = MH! (épr) cos ‘ rere (47) 
H, = NH}? (ipr) sino 


The parameters x, y and p are given by the equations (34), (35), and 
(36). The other field components are obtained from the longitudinal 
components by differentiation 


a<rs<b, (46) 














i (, dE, 1H, 

B= — 2 (e+ am, 4 HE), a9 
i (,1 aE, aH, 

i, = — 2 (pte — on ay (49) 
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_ _1(, dH. of) 
_ i (,10H, 2 Bh, 
Ay, = hg (s r ad + ween ere (51) 


The symbol T stands either for x, zy, or 7p depending on the medium 
in which the field is evaluated. The same is true for n, it represents 
either 2. , Nm, or nj. The boundary conditions establish the following 
relations between the amplitude coefficients: 


BL (#) kan’? {x [2 - Za] 
A \po B(K a Y) k Lea J, (xa) 
Tm | 1 Hy Gao) 
- ya E  #A 1 (tva) ; (52) 


CF _ _Silxa)_ 
A ~ BH Gra)’ (68) 


DYGwrO, (54) 





{Ll 2)—2)— rob Meret 
+4 et — nd) | + 2am (1 —%)h 
{[é (1 = x) = wy'(1 + (na, + 13 xy (55) 


2 
hess 2rky?(n’, + 3 x) + 2untyesr ® (5 —1)¢ 
v= -m 


meee eer iceer 
Lb p p p 


The symbols Jp and J, indicate Bessel functions of zero and first order; 
Ho (tya) and H, (tya) are Hankel functions of zero and first order 
and of the first kind. We use here the notation of a Hankel function 
with imaginary argument used by Jahnke and Emde.’ The coefficients 
M and N given by (55) and (56) do not contain any Hankel functions 
because the approximation for large arguments was used. The relation 
between the power P carried by the HE;; mode in the waveguide and 
the amplitude coefficient A is given by the equation 


(56) 
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Ps x (lib [(ax)*(Jo(ka) + Ji(ka)) — “aise ane 0B) 


a 2 27,2 2 2 

+ 2( 42) B | r tek ey Bo Bo F Mabe” {16 o\(2 Je Ae. (57) 
Eo A K Y 

For sufficiently large cladding radius b the propagation constant of 

the mode can still be obtained from the eigenvalue equation for a rod 

embedded in a medium with refractive index n,, . 


(ee Fees ~ a) tebe — ree (ES i) 


+n “(ya hee - i)} = | ot 2) Bim nae (58) 


This collection of formulas is sufficient to calculate the coupling term 
(29) for the coupling of the HE,, modes in two parallel cladded 
optical fibers. We distinguish two cases. We assume that the modes 
are elther both polarized perpendicular to the plane connecting the 
two fiber core centers or that they are polarized parallel to this plane. 
The coupling is different in the two cases, corresponding to the dif- 
ference in coupling between the TE modes and the TM modes of the 
slab waveguide treated in the preceding section. It must be pointed out 
that a perpendicularly polarized mode does not couple with a horizon- 
tally polarized mode. The two-mode assumption is thus still valid. 

The integral occurring in (29) cannot be solved generally for two 
coupled HE ,,; modes. However, if we assume that the distance R 
separating the two cores is large compared to the core radius “a,” we 
are justified in expanding the expression for the radial coordinate r 
(reaching from the center of guide one to the vicinity of guide two) 
in the following way (Fig. 3), 








= VR? +r? + 2r.R cos 0 YR +1, cos 0. (59) 


The angle @ describes the departure of the direction of rz. from the 
line connecting the core centers, 72 is the distance from the center of 
guide two to the endpoint of r;. When the expansion (59) is used, 
the angular integration can be performed leading to Bessel functions 
of the imaginary argument iprz. The remaining integration over 7 
now involves only products of cylinder functions and can be carried 
out. We obtain the following result for the case of horizontal polariza- 
tion: 
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_ («)' A? M1 (22) me _Is(ca)_ ( e 
0. = (2) a5 rg Ge) oat Bites LOL + Sa) + 9 
Be »/( — fue BY) 
+ by 2 p (p v’) 1 B €0 A 
tH" (tya) 


oe il (e = ae 4 (E a 7) HG TH) 
~ Se + ni(1 — Ef F) rc |. (60) 


For an. case of vertically eae modes we get 


ap, = B81 (ayo 2 [ol) (+ m0 + 27) 
-£ 9 | eo 

YP Hy” ya) 
4. So feoe 1a, ee Flea) 


ma Ble ey (1 5 kes 5) s40]} (61) 


Finally, we een also the expression for the power loss 2a of the 
HE,,; mode caused by the lossy surrounding medium. The calculation 
follows the idea described in connection with the slab waveguide. 


ie (#) era bo a | (e ®) a MNES | 
—k|M [Im (18) — a Lav fim (2)}. (62) 


The coefficients M and N are given by (55) and (56), p, is the real 
part of p, and the symbol Im(_) indicates the imaginary part of the 
expression in parenthesis (or brackets). 














VI. NUMERICAL RESULTS 


To illustrate the consequences of the theory presented in this paper 
a few sample cases have been computed. For comparison purposes 
I have checked the results of my theory against the curves for the 
coupling of uncladded fibers published by Jones.1 The agreement 
between his theory and similar curves obtained from (60) and (61) is 
excellent. 

The amount of coupling between two optical fibers depends on the 
type of mode they are carrying, on the separation between the fiber 
cores, and on the loss of the surrounding medium in which the fibers 
are embedded. Figure 4 shows a set of curves describing the coupling 
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parameter a |A@| as a function of the loss in the surrounding medium. 
The abscissa labeled “loss” measured in db indicates the plane wave 
loss in going through the surrounding medium for a distance R — 26; 
that means it is the loss that is encountered in going perpendicular 
to the fiber axis from the cladding boundary of one fiber to the other. 
There are two curve parameters in Fig. 4. The two sets of curves differ 
by the cladding thickness normalized with respect to the core radius, 
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Fig. 4—Coupling of round fibers. a|Ag| is the coupling parameter of equations 
(60) and (61), the abscissa indicates the loss (in db) that a plane wave would 
experience in going from the boundary of one fiber to the boundary of the other 
fiber. The solid lines hold for horizontal polarization, equation (60), while the 
dotted lines apply to the case of vertical polarization, equation (61). nnka = 16, 
ya == 1.594, ne/rm = 1.01, Re(me) = nm. 
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(6 — a)/a. The coupling is, of course, reduced for increasing cladding 
thickness since the fiber cores are farther removed from each other. 
The second parameter is the separation between the fiber claddings 
normalized with respect to the fiber core radius, (R — 2b) /a. Increas- 
ing values of this parameter again cause the fiber cores to be sepa- 
rated farther from each other. The loss values that form the abscissa 
hold regardless of fiber separation, always indicating the total loss 
that exists between the fiber claddings on their closest approach. The 
curves show clearly the decrease in coupling with increasing loss in 
the surrounding medium. The solid curves apply to the horizontal 
polarization of the two modes and are obtained from (60). The dotted 
curves apply to vertical polarization and were computed from (61). 
Figure 4 holds, of course, only for one particular pair of round optical 
fibers. It was assumed that n,,ka = 16, and that the ratio of core index 
to cladding index is n.-/nm = 1.01. For simplicity, it was furthermore 
assumed that the real part of the index ng is identical to the index nm» 
of the cladding material. The imaginary part of n3 is varied in order 
to obtain the variable loss values of the abscissa. The abscissa can 
be expressed by 8.68 Im (n3)k(R — 2b). 

With the help of (10), Fig. 4 can be used to evaluate the crosstalk 
between the two optical fibers to which the conditions of Fig. 4 
apply. Let us assume that we can tolerate a power ratio V = 10-° at 
the end, z = L, of the two fibers. If we use, for example, a = 1 »m and 
allow L = 1 km we find that we must not exceed 


a|Ag| $10”. (63) 
Any value appearing in Fig. 4 below this line thus leads to acceptable 
crosstalk levels. It is apparent that all curves with (b — a)/a = 10 
are acceptable. For (b — a)/a = 5 a certain amount of loss is re- 
quired to reduce the coupling and consequently the crosstalk below the 
desired level. 

Figure 5 presents the same data as Fig. 4 applied to two identical 
slab waveguides. It was assumed that the refractive indices as well 
as the guide dimensions of the slab waveguides correspond exactly 
to the round optical fibers of Fig. 4. (a = d, b = D). However, the 
two slab waveguides are less tightly coupled than the corresponding 
round fibers. The reason for this behaviour is the difference in the 
decay parameter ya (or yd). For the round fibers, we obtained from 
the eigenvalue equation (58) the value ya = 1.594 while (41) results 
in yd = 1.997 for the slab waveguides. A larger value of the decay 
parameter indicates that exponential decrease of the field amplitude 
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outside of the fiber core is more rapid. The more rapid decrease of the 
field of the slab waveguide leads to less coupling. The solid and dotted 
lines have the same meaning as in Fig. 4. The solid lines apply to the 
TM: mode which is polarized horizontally while the dotted lines apply 
to the vertically polarized TE mode. For simplicity we have assumed 
that the values of yd are identical for the two modes which, strictly 
speaking, is not quite true. However, we see from Fig. 4, as well as 
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Fig. 5—Coupling of slab waveguides. Coupling parameter and loss have the 
same meaning as in Fig. 4. The solid lines indicate the TM mode (horizontal 
polarization) while the dotted lines apply to the TE mode (vertical polarization). 
Nnka = 16, ya = 1.997, n-e/rm = 101. 
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from Fig. 5, that the horizontally polarized modes are coupled more 
tightly than the vertically polarized modes. 

Figure 6 presents a comparison between the coupling strength of 
round fibers and slab waveguides. Since equal geometry does not lead 
to identical values of the decay parameter we have arbitrarily used 
equal values of ya = yd = 1.594 for both types of waveguide. The 
solid lines indicate the results for the round fiber while the dotted 
lines apply to the slab waveguides. It is apparent that the slab wave- 
guides are more tightly coupled when the conditions are such that 
both guides have equal decay parameters. This appears reasonable if 
we consider that the slab waveguides have constant distance from 
each other over an infinite length while the round fibers have a point 
of closest approach so that the total amount of field overlap is less 
in this case. 

The lossy medium does not only serve to reduce the coupling be- 
tween the waveguides but it also has the adverse effect of increasing 
the loss of the modes traveling in the guides. Figure 7 is a plot of 
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Fig. 6—Comparison between the coupling of round fibers and slab waveguides 
(a = d,b = D). The parameter value ya = 1.594 is correct for the round fibers. 
It has also been used for the slab waveguides to enable a realistic comparison. 
nmka = 16, n-/nm = 1.01. 
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mode power loss versus plane wave power loss 2k[Im (ns)a] in the 
surrounding medium obtained from (43), (44b), and (62). Both types 
of waveguides are represented in this figure. The loss values of abscissa 
and ordinate are both expressed in db. The decay parameters ya and 
yd (we use a = d and b = D in Fig. 7) are solutions of the respective 
eigenvalue equations and are not adjusted arbitrarily. We have seen 
that the coupling of the slab waveguides was weaker than the coupling 
of the fibers under these conditions. It is thus not surprising to see 
from Fig. 7 that the losses of the fiber are larger than the slab wave- 
guide losses. It is interesting to see that the symmetric TM mode of 
the slab waveguide is slightly lossier than the symmetric TE mode. 

Assuming again that a = d = 1 pm and limiting the allowable loss 
caused by the lossy surrounding medium to 1 db/km, we find that we 
can accept those curves with losses of less than 2ea = 10-° db. Since 
the loss curves are nearly independent of the actual value of the loss 
in the surrounding medium (after an initial rise from zero mode loss in 
a lossless surrounding medium) we can exclude all curves above the 
10~° line as unacceptable in case of a lossy surrounding medium. We 
thus see that the slab waveguide works well under all conditions shown. 
For the more important round fiber we must exclude the case of a 
cladding with (b — a)/a = 5. This means that the curves of Fig. 4 that 
lead to excessive coupling in the absence of sufficient loss in the sur- 
rounding medium are also unacceptable from the point of view of 
additional mode loss. We see that additional mode loss and mode 
coupling have similar trends. When the coupling between the fibers 
is too large we also have to contend with additional mode loss in 
excess of what we want to tolerate. A guide with low mode loss on 
the other hand is also sufficiently protected by its cladding so that 
coupling between adjacent fibers is not critical. 

The two remaining curves, Figs. 8 and 9, show the coupling param- 
eter and the additional mode loss for the case of a guide with lower 
core-to-cladding ratio. Only the case of the higher coupling for hori- 
zontally polarized modes has been plotted in Fig. 8. The remarks about 
coupling strength and additional mode loss hold true also for these 
conditions as can be seen from the figures. 


VII. CONCLUSIONS 


We have presented a perturbation theory for the coupling between 
two arbitrary dielectric waveguides made of lossy materials and 
embedded in a lossy environment. Only the case of two degenerate 
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Fig. 7—Additional mode loss caused by the lossy surrounding medium. The 
abscissa indicates the loss (in db) that a plane wave would suffer by traveling 
a distance equal to the core radius in the surrounding medium. nnka = 16, 
Ne/Nm = 1.01. 


modes has been treated. The general theory has been applied to cladded 
slab waveguides and cladded round optical waveguides (fibers) with 
lossless cladding material but in the presence of lossy surroundings. 
We have found that the coupling between fibers depends very strongly 
on the separation between the two fiber cores. Far separated cores 
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result in loose coupling. The coupling between the fibers can be re- 
duced by increasing the losses of the surrounding medium. However, 
a lossy surrounding medium introduces losses to the waveguide modes 
to such an extent that the mode losses are unacceptably high if the 
loss in the surrounding medium is necessary to uncouple the fibers. 
From this point of view it appears as though losses in the surrounding 
medium are unsuitable to achieve uncoupling or reduction of cross- 
talk between optical fibers. However, the coupling mechanism dis- 
cussed in this paper is not the only contributor to crosstalk. Imperfec- 
tions in the fibers cause light scattering that can also be a source of 
crosstalk. This mechanism is less strongly dependent on the separa- 
tion of the fiber cores so that it may not be possible to uncouple fibers 
sufficiently simply by protecting them with a cladding of sufficient 
thickness. 


19-20 


10-22 





10-24 


0 20 40 60 80 100 
LOSS IN DECIBELS 


Fig. 8—Coupling of round fibers. Only the case of horizontal polarization is 
shown. tmka = 21, ya = 0.857, ne/nm = 1.003. 
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Fig. 9—Additional mode loss caused by the lossy surrounding medium. nmmka = 


21, Ne/Nm = 1.003. 
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Crosstalk Caused by Scattering in 
Slab Waveguides 


By D. MARCUSE 
(Manuscript received March 1, 1971) 


Light scattered in one optical fiber can be scattered back into a 
guided mode in a neighboring optical fiber. This type of scattering 
crosstalk is investigated in this paper for slab waveguides. However, 
the results for the slab waveguide case are upper limits on the crosstalk 
between round optical fibers. Scattering crosstalk 1s expressed in terms 
of guided mode radiation loss whose cause is the same scattering 
mechanism. It is thus not necessary to know the details of the scatter- 
ing mechanism as long as the scattering loss, which is a measurable 
quantity, is known. In addition to the total radiation loss the cross- 
talk depends on the width of the radiation lobe in which the scattered 
energy escapes from the waveguide. The crosstalk is inversely pro- 
portional to the width of the radiation lobe and thus is larger when 
the radiation lobe 1s narrow. Scattering crosstalk can become serious if 
both guides have a systematic sinusoidal imperfection of the same 
mechamecal frequency resulting in a radiation lobe of nearly zero 
width. In the absence of a systematic sinusoidal imperfection it can 
be concluded that scattering crosstalk between dielectric waveguides 
(optical fibers) 1s negligible if the radiation losses are tolerable. In 
this case it appears unnecessary to suppress crosstalk by means of a 
lossy medium between the waveguides. 


I. INTRODUCTION 


In an earlier paper? I discussed the crosstalk between optical dielec- 
tric waveguides based on the directional coupler mechanism. In that 
case, crosstalk was caused by the fact that the exponentially decaying 
field of one guide reaches the region of a neighboring guide. This type 
of crosstalk exists even if both guides can be considered to be perfect 
dielectric cylinders. 

The present paper is devoted to a crosstalk mechanism of a different 
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type. A dielectric waveguide or optical fiber loses power by radiation 
if the guiding structure is imperfect in any way. This imperfection 
may consist of deviations in the perfect cylindrical geometry of the 
core-cladding interface or it may consist in random variations of the 
refractive index distribution. Some of the radiation, causing loss to 
the guided mode of one fiber, may reach a neighboring fiber and can 
there be scattered back into one of its guided modes. This type of 
crosstalk is thus caused by waveguide imperfections. However, just as 
in the case of the directional coupler effect, it is possible to link pre- 
dictions of the effectiveness of this type of crosstalk mechanism to 
the loss it causes to the guided mode. This makes it possible to give 
very simple rules that link crosstalk to radiation loss (see equation 
(50)). It turns out that scattering crosstalk is no serious problem 
provided that the radiation loss of the guided mode (that is caused 
by the same mechanism) remains within acceptable limits. 

Our treatment of the scattering crosstalk problem is simplified by 
limiting it to the case of the slab waveguide. It is intuitively clear 
that the crosstalk between slab waveguides must be stronger than 
crosstalk between round optical fibers. In fact, the crosstalk between 
parallel slab waveguides is independent of their separation while the 
crosstalk between round fibers must vary inversely proportional to 
their distance. Furthermore, our treatment of crosstalk starts out by 
assuming a definite scattering mechanism. That is, we assume that 
the core of either guide varies in width. However, this special assump- 
tion allows us to describe the more general case. No matter what the 
mechanism, scattcring of light can be attributed to the individual 
Fourier components of either the core width variations or of the de- 
parture of any other quantity from its perfect value. Regardless of 
the particular mechanism each Fourier component causes a narrow 
radiation lobe to escape in a definite direction. The same mechanism 
that produced the radiation lobe in one guide is responsible for the 
capture of some of this radiation in the neighboring guide. It is thus 
immaterial by what mechanism the radiation and reconversion was 
produced. In particular, if we express the strength of the scattering 
mechanism by the radiation loss that it causes to the guided mode, 
we have a description of the crosstalk effect that 1s independent of 
the actual scattering mechanism. Realization of these few basic facts 
greatly simplifies the treatment of the scattering crosstalk problem. 

The deterministic scattering theory can easily be extended to include 
statistical assumptions about the scattering mechanism. We are thus 
able to express the scattering crosstalk in terms of three parameters: 
the scattering mode loss, the ratio of waveguide length to free space 
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wavelength of the guided radiation, and the width of the radiation 
lobe. 


II. CROSSTALK CAUSED BY SINUSOIDAL CORE THICKNESS VARIATION 


As discussed in the introduction, we are basing the treatment of 
scattering crosstalk on the mechanism of core width variations. To 
begin with, we concentrate on crosstalk caused by sinusoidal varia- 
tions of the core of both slab waveguides. The more general case of 
arbitrary core width variations can be treated by decomposing the 
arbitrary functions into Fourier series and utilizing the result for a 
single sine wave component. Finally, we express the crosstalk in terms 
of radiation losses suffered by each guided mode and are thus able to 
free ourselves of the particular scattering mechanism. 

It is well known that dielectric waveguides possess two types of 
modes. At a given operating frequency there are a finite number of 
guided modes and a continuum of radiation modes. The coupling 
between these two types of modes caused by variations of the core 
thickness has been explored in earlier papers.?* We can thus simply 
draw on these earlier results for our present purposes. 

The even and odd radiation modes of the dielectric slab waveguide 
[equations (24) and (25) of Ref. 3] can be expressed in the 
following simple form outside of the core, x, > d (Fig. 1): 








€ 3 
6,°(p) = (7?) cos [p(| 2, | — d) + ¢le™, (1) 
Bie) = 72 (EY cos fall I- D+ Me™ 


As usual, the factor exp(+ tot) has been suppressed. The phases 
appearing in (1) and (2) are given by 


tan ¢@ = 7 tan od (3) 
and 
tan y = a cot od, (4) 
with (k = wVeu,) 
p = (nzk° — B°)} (5) 


and 
o = (nik — p’)*. (6) 
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Fig. 1—Geometry of the two slab waveguides. The figure shows the definition 
of several parameters and indicates a scattered light beam (as a ray) going from 
guide 1 to guide 2. 


The slab half-width is d; n and Ne are the refractive indices of core 
and cladding. The power carried by the radiation mode is defined by 
the orthogonality condition of the radiation modes: 


Fe [a @)8i(o!) da = Pa, (0 — 6’). (7) 
WU d-«0 | 

8¢) 1s the Kronecker delta symbol while 8 (p — p’) is Dirac’s delta func- 
tion. An even guided TE mode is coupled to only even radiation modes 
by a sinusoidal variation of the core thickness of the form 

dy =d + ay sin 6,2. (8) 
According to Ref. 3 the radiation field excited by the sinusoidal thick- 
ness variation of the slab core is given by 


_ ayk?(Qap.P) (ni — nz) COs Kd 





E, = ; 
(6,4 ae és) 
v1 
; sin A= 
ae cos od cos [p(2 Par d) + o| er iB A/2)2 2 dA (9) 
b-Bi-naz {p COS od + a’ sin’ cd}? A 
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This formula holds for d < 2; < o. The variable A is given by 


A= 6, —- (B1 = 6). (10) 
The parameter @,; is the propagation constant of the even guided TE 
mode. The other parameters appearing in (9) are 


kK, = (n2k? — By} (11) 
and 


v1 = (Bf — nigh’)? (12) 
There are two differences between (9) and equation (27) of Ref. 3. 
We are now dealing with core thickness variations instead of a varia- 
tion of only one of the faces of the slab core. The guide length L had to 
be replaced with the length coordinate z since we assume that the 
guide is very long and that we are looking at a field close to the fiber 
core and not at z > L infinitely far from the sinusoidally perturbed 
section of length L. Finally, we have replaced the differential dp of 
equation (27), Ref. 3 by 


ine dA. (13) 


For large values of z the factor (1/7) (sin Az/2)/A approaches a delta 
function. We are thus permitted to take slowly varying functions out 
of the integral and are left with 


“ sin As 
Lim | cos [p(2, _ d) + dle" = ak 


= 5exp {-ilB + a. —d)+¢)}. (14) 
The limits on the integral were extended from —o to +o. This 
does not change its value since only the immediate vicinity of A = 0 
gives a contribution. The dependence of p and 6 on A follows from 
(5) and (10). The propagation constant 8 is given by 8 = B) — 6, (or 
A = 0). It is more convenient, however, to express it in terms of the 
angle a at which the radiation is leaving the slab core.* We have — 


B = nk cos @ (15) 
and, similarly, | 


p= nk sin a. (16) 
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The. radiation field that is caused by scattering of light from the 
even TE guided mode by a sinusoidal variation of the core thickness 
is thus simply a plane wave. 


Pa a,k?(Quu,P)?(n; — n2) cos x,d cos ed giFt-o o-ilBesFen) (47) 


of (0 ae BN cos’ éd + é’sin a} 


a 
2 


The phase terms ¢ is given by (3), the parameter ¢ follows from (6) 
by a replacement of 8 with B of (15). 

This calculation proves once more that the standing-wave radiation 
modes result in traveling waves when properly superimposed. 

We have so far ignored the finite width of the cladding. We could 
easily take reflections and refraction at the cladding boundary into 
account. However, our theory is meant as an order-of-magnitude 
estimate of crosstalk so that the small reflection losses at the cladding 
boundary will simply be ignored. 

As the radiation reaches the core of the neighboring slab waveguide, 
reflection and refraction are taking place. It is intuitively clear that 
the plane wave impinging on the core of the neighboring slab wave- 
guide can be expressed as a superposition of an even and an odd 
radiation mode of the guide. In fact, it can be shown that the plane 
wave impinging on the second guide can be expressed by 

BE, = Gfe"*8,(6) — e *8,()}, (18) 


with the even and odd radiation modes (1) and (2) being expressed 
in a coordinate system centered at the core of the second guide. The 
coefficient is given by 


-i¢ a,k? (a8)? (n? =. n>) cos Kya COs od 7 tF(R-24) 


(es 
24(s.a + Bs Ng cos’ éd + &* sin? ai} 


; (19) 


2 


R is the distance between the centers of the two waveguides. We now 
know how the radiation originating at the first guide excites the radia- 
tion modes of the second guide. If we assume that the second guide 
also has no other imperfections than thickness changes of the core 
and if we are concerned with the excitation of another even TE mode, 
only even radiation modes of the second guide can contribute to the 
crosstalk problem. 

The question of how an even radiation mode excites an even guided 
TE mode has been solved (in principle) in Ref. 2. The excitation of 
the guided mode is expressed by an excitation coefficient c(z). How- 
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ever, we are interested only in the value of c at z = L. To the first 
order of perturbation theory we obtain 

c(L) — fe RRCP-2) g—tpar(R—-2D) ,— aa (nak/par) (R—-2D), ~ 246 


a,dopk*(n} — n3)? cos xd COs kod cos’ ade **” 


$ 
2 (6. + 51) 6a + #)} (p’ cos’ cd + &’ sin’ Gd) 





L = 
f pee (sin iu ote-P dz. (20) 
0 


Several new parameters have been introduced in (20) (see Fig. 1). 
FR is the separation between the core centers of the two guides. The 
first exponential factor of (20) accounts for the phase shift of the plane 
wave inside the claddings of the two guides with cladding thickness 
D — d. The second exponential factor accounts for the phase shift 
in the medium of index n; between the two guides (separation R — 2D). 
The value of 5 in the medium between the guides was designated by 
ps, (see Fig. 1). The subscript r indicates the real part of p; . The imag- 
inary part of p; attenuates the plane wave as it traverses the space 
between the two guides. We have assumed that the core and cladding 
of both guides are lossless but allow for the possibility that the medium 
separating the two guides may be lossy. The loss suffered by the plane 
wave in the medium between the guides is given by the third exponential 
function in (20). The real part of the refractive index of the medium 
between the guides is n3; , the amplitude loss coefficient of the wave 
is a3 . The ratio n3k/ps, adjusts for the slant angle of the wave and 
R — 2D is the separation of the two guides (cladding-to-cladding 
distance). The subscripts 1 and 2 refer to quantities belonging to 
guide 1 and 2. Finally, we have added the amplitude loss coefficients 
a, and a, to account for the losses suffered by each guided mode. These 
losses can be thought of as the sum of heat losses in each guide plus 
radiation losses caused by the variation of the core thickness. The 
factors a, and a, indicate the amplitudes of the sinusoidal thickness 
variation of the cores of each guide. However, it is assumed that the 
perfect geometry and the refractive indices of both guides are identical. 
The mechanical frequencies 6, and 6, are not independent of each 
other. Only when the propagation constant 6. of the guided mode of 
the second guide and the mechanical frequency 6, of the thickness 
variation of its core are related to each other by the equation 


6. = Bo — B (21) 


can any amount of power be coupled from one guide to the other. 
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From (10) (with 6 = 6 and A = 0) and (21) we obtain the condition 


6, = 6, — (81 = B2). (22) 
The range of 6, values contributing to radiation is given by® 
Bi — nk < 0 < Bi + nok. (23) 


In the more general case of a Fourier spectrum of mechanical fre- 
quencies of the core thickness variation, we see that each Fourier 
component of guide 2 combines with a certain Fourier component of 
guide 1 to provide crosstalk. 


Ill. GENERAL CORE THICKNESS VARIATION 


Having determined the amplitude coefficient of the guided mode in 
guide 2 that is excited by purely sinusoidal variations of the thickness 
of the fiber cores we can immediately generalize to the case of arbi- 
trary core thickness variation. We expand the core thickness of each 
guide in a Fourier series: 


4 © 
d,=d+ (2) >> ay, sin 6,2, (24) 
v=] 
a < 
d,=d+ (2) >> dy, sin 4,2, (25) 
B=1 


with 6, = vz/L. We choose the Fourier sine series because we already 
know the result for a single sinusoidal thickness variation. The ex- 
pansions (24) and (25) involve a complete set of orthogonal functions. 
By replacing 


a,— (2) a F (26) 


a, — 2 ei. (27) 


and summing over v we convert (20) to the general case. Only one 
summation is required since each Fourier component of guide 1 com- 
bines with only one definite Fourier component of guide 2. However, 
before writing down the resulting formula we will make two more 
changes. Furst, we can convert the sum to an integral with the help of 
the relation 


and 


Doe fas, (28) 
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Second, we introduce the Fourier coefficient (6) that was used to 
express the radiation losses in Ref. 3. By definition we have 


(0) = + I @,—de™ de 5 =1,2. (29) 


The relation between ¢;(6) and aj, is given by the following equation 


66) = Im 6G) =—-gprar F=L2 CO 


The notation Im indicates the imaginary part. Then we form the ratio 
of power at the end of guide 2, P,(L), to the power at the end of guide 1, 
P,(L) = es 


Bin A tele Ete = PPh lo |? (31) 


With the help of (21) the integral in (20) can easily be performed 
and we finally obtain 


PL) _ Lift = 3)"b* cos" md cos" nade ** (2 = —y 
P\(L) 
— w'(ed + © \(s0 + &) 


Nee ob: (4; Jobo" (4.)B COs éd 
fi-nak Bp Cos Gd + 6’ sin’ ad 





QA, — QA 





2 
| 
Equation (32) is the expression for the crosstalk between two identical 
slab waveguides. The coupling mechanism is the thickness variation 
of the two waveguide cores. The even guided TE modes in either 
waveguide need not be the same. 

In its general form the crosstalk ratio is not very useful since the 
spectral functions ¢; and ¢2 are usually not known. However, it is 
possible to provide estimates of the crosstalk by using the radiation 
loss of the guided mode that is caused by the same mechanism that 
provides the crosstalk. The scattering loss was given in Ref. 3 
(e=. 1, 2): 


ee ae L(ny — n3)"k* cos” xd 
na.) ale B, 
(6.4 a a 


nek 2 4 
| Lote) | pcos od ge gg 


-nak P COS’ od + co’ sin’ od 





ig tg * {26(D—d) +03r(R-2D) poe al paki par) (R-2D) dé, 


(32) 
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[There is an error in equation (14a), Ref. 3. The left-hand side 
should read APd/(PL).] Equation (33) differs from equation (14) 
of Ref. 3 since we have here assumed that the core thickness varies 
while in Ref: 3 it was assumed that only one side of the core cladding 
interface is distorted. 

For simplicity we consider now the case that the two modes of the 
coupled guides are identical and that only one sine wave component 
of the Fourier spectrum exists. We thus have (¢1 = ¢2): 


| 6:(0) |? = |b (OP = oF a 8(0 — 6”) (34a) 
and 
(ar? (@)!? = 41°(0)6:° (8) = Fe a’ 60 — 0), (34b) 
and obtain from (32) and (33) with a, = ae 
P,(L) = 2 —2as(nsk/psr) (R-2D) ; 
P,(L) (L) > (1/4) (2a,L)°e : (35) 


The index r is meant to indicate that a, is the radiation loss. In this 
special case the crosstalk is simply proportional to the square of the 
mode power loss coefficient 2a,L times the power loss that the plane 
wave suffers in going from guide 1 to guide 2 through the lossy medium 
between the guides. 


IV. RANDOM VARIATIONS OF THE CORE THICKNESS 


The average radiation loss is simply obtained by replacing |¢;|? 
with the ensemble average (|di|?) in (33). Obtaining the ensemble 
average of (32) is a little more complicated. To simplify the discussion 
we write the integral expression occurring in (32) in the following 
form 


Bitnak 2 
1=| [606° OFO a0 





FL o!O)60)F@) |, 6) 
with 
0, = 6, — (6, — Br). (37) 


With the help of (30) we can also write for the ensemble average of I 
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2 
(I) as 7 Di (0 Oi, 02,034) F (0,)F*(8,-). (38) 


The asterisk indicates complex conjugation. It appears reasonable to 
assume that there is no correlation between guide 1 and 2. We thus 
assume that we can write 


(a, ,0*, A2,a%,-) = (a,,a%,-)(a2,0%,"). (39) 
It is shown in Ref. 4 that for a stationary random process with 
[I — © we have 


(a1a%,") = { [ary |’)8,.-° (40) 

[The proof presented in Ref. 4 for the usual complex Fourier series 

can easily be extended to the Fourier series (24). The conclusion (40) 
remains correct. | 
We thus have 


= Fi Law PML as PY 1 FCO P (a1 


Returning to the integral notation we have 


Bitnak 
CQ) =F J, M66) PL 3°62) FP) 1 CG) Pas. (42) 


For stationary random processes we can assume (|¢")(6)|?) = 
VY (|$(8)|?). We thus obtain the ensemble average of the crosstalk 
in the form: 


(P2(L)/P.(Z)) 
(2a, L)(20,2L) 


giercena(] _ a) } a {| 1 (6;) i x $2( 62) i se “nak cos* od er" (nsk/par) (R—-2d) da 
T De dG (p° cos® od + &* sin’ ed)? ; 


~ AngkL c _A.44(8) Fo cos" od ie (| 42(6) [?)p* cos’ od_ 4, 


p cos’ od + o’ sin’ cd p cos’ cd + o’ sin’ od 





a, — a2 














(43) 
We used (15), (16), and (10) which, with A = 0, assumes the form 
0; = By a B = By r— rake COs @ (44) 
to obtain 
dé, = nk sin ada = pda 


for the integral in the numerator of (43). The angle a indicates the 
direction in which the plane wave radiation from a given Fourier 
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component escapes from the waveguide core. (The angle a must not 
be confused with the loss coefficients a; or a, .) In the integrals in the 
denominator we have likewise used the transformation 


B = nk cos a, (45) 
so that we obtain from (5) 
p = (n2k? — B’)* = nok sina (46) 
and ; 
dB = —pda. (47) 


For a discussion of (43) it is much more convenient to consider the 
case that mode 1 of guide 1 and mode 2 of guide 2 are identical. The 
conclusions for the more general case are very similar. We can further- 
more assume that the power spectra of the two guides are identical 
since they are supposed to be statistically similar, 


( 611?) = ([b2 |?) = (16 |’). (48) 
We thus obtain the much simplier formula (For simplicity we use 
az = 0) 


(P2(L)/P,(L)) es o (p cos’ od + o’ sin’ od)? 
Qa,L) = Inkl f he@ Die eot ca.) 
o p cos od + ao’ sin’ ad 


(49) 


V. DISCUSSION OF THE SCATTERING CROSSTALK 


The crosstalk for identical mcdes, equation (49), can be expressed 
approximately by very simple formulas. Let us assume that the 
Fourier spectrum of the core thickness variation is so narrow that only 
a narrow beam of radiation leaves guide 1. The radiation is centered 
around an angle a and fills a range Aa. Provided this range of angles 
is sufficiently narrow and if we also assume that the Fourier spectrum 
is constant in the region contributing radiation and zero outside of 
this region, we can write (49) in the following simple form. 








P,(L) _ | v 1 2 -2a3(R-2D)/sina 
(Fa ARE (Ao aia 20) 

(The angle a and the loss coefficients a, and a3 must not be con- 
fused.) We have reinserted the loss in the medium between the two 
guides in (50). We thus have the interesting result that the crosstalk 
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depends on the square of the loss coefficient of the guided mode, on the 
loss in the medium between the waveguides, and on the width of the 
radiation lobe. The crosstalk decreases with increasing lobe width. 
In fact, by comparing (50) with (35) we see that we can let Ae become 
as small as 


T 
ein nokL sina’ (51) 
because in this limit (50) applies to a single plane wave coupling 
the two guides together. 
In the other extreme it can be shown that (49) can be approximated 
quite well by the expression 


P,(L) Beeps 72 

(Fe) =o 4n kL (2a,L),, (52) 
if we assume that (|¢|?) = const, which corresponds to the case of 
random scattering of radiation in all directions. We assumed a3 = 0 
in (52). The error between the approximation (52) and the equation 
(49) is always less than a factor of two. This is not a bad approxima- 
tion for an order-of-magnitude estimate. It is interesting to note that 
we obtain (52) formally from (50) by taking 





Aasina = s (53) 
Since it is certainly reasonable to assign Aw = 7z to the case of 


isotropic radiation, we see that the crude approximation (50), which 
holds precisely only when Aa@ is very small, can actually be used to 
estimate the scattering crosstalk for all cases of interest. 

It is furthermore important to note that the model of scattering 
used to derive our equations is very likely to be of no importance to 
the results. Since we could express the crosstalk in terms of the radia- 
tion loss of the guided mode and some features of the radiation 
spectrum we can be confident that our equation (50) holds at least 
to order of magnitude for any type of waveguide imperfection that 
causes radiation. 

If we ask ourselves what relation the slab waveguide theory may 
have to the case of round optical fibers we can only draw some very 
general conclusions. The plane wave radiation produced by the slab 
waveguide does not diminish with distance. It is thus not surprising 
that the crosstalk formula (50) is independent of distance except for 
the obvious distance dependence of the loss term containing a3. The 
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radiation of a round optical fiber has the form of cylindrical waves 
so that we must expect that, in addition to the features expressed by 
(50), there will be an additional factor d/R multiplying the cross- 
talk expression. It is hard to say whether this adjustment will yield 
an expression that describes the crosstalk for round fibers to a fair 
approximation. However, it appears safe to assume that the crosstalk 
predicted by the slab waveguide theory is actually larger than the 
crosstalk for round optical fibers. We can thus consider the results 
of this paper as an upper bound of the crosstalk of round optical 
fibers. Any design features that are predicted on the basis of this 
theory are likely to be conservative if applied to the round fiber case. 

We conclude our discussion with a numerical example. Let us 
assume that the length of the guides is L = 1 km. Taking d = 1 pm, 
we have L/d = 10°. Next, we assume that n.kd = 10 and that the 
radiation loss of the waveguides is 4 dB/km corresponding to 2a, = 1. 
In the absence of loss in the medium between the two guides (a3 = 0) 
we thus obtain from (52) 





P(Z) -10 
(7 = 8/4 X 10. (54) 
The crosstalk is thus insignificant even without trying to isolate the 
guides from each other by providing loss in the medium between the 
guides. This example is actually quite representative. The assumed 
mode loss is typical for Rayleigh scattering losses in good solid 
materials. Since Rayleigh scattering is the theoretical limit of scat- 
tering loss, one should consider the crosstalk caused by this mechanism. 
Our conclusion is thus that Rayleigh scattering does not cause appre- 
ciable crosstalk between two optical fibers. The situation worsens 
somewhat if the radiation is bunched. However, we can stand a 
much smaller value of Aa than Aw = =z before crosstalk becomes a 
problem. It appears that the only real danger is the possibility of a 
pure sinusoidal thickness variation of the fiber core such as may be 
caused by a systematic flaw in the fiber pulling process. Such an 
imperfection can cause serious crosstalk even if the radiation loss 
attributable to this mechanism is quite small. We see from (385) that 
a purely sinusoidal distortion of the waveguide core causing only 
2a,L = 10 can cause a crosstalk ratio of 2.6 x 10°. The theory of 
Reference 2 predicts that a loss of 2a,l, = 10% is caused by a sin- 
usoidal core thickness variation with an amplitude as small as = 10° 
pm. 
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VI. CONCLUSIONS 


Using the model of two slab waveguides with core thickness 
variations we have computed the crosstalk that is caused by light 
scattering in one guide and reconversion of this scattered light in the 
other guide. Since the results of the theory can be expressed in terms 
of the radiation loss that is caused by the same scattering mechanism, 
it is assumed that the theory has general validity. It is also concluded 
that the crosstalk of round optical fibers is smaller than that of two 
slab waveguides so that the theory can be considered as an upper 
bound on the crosstalk between fibers. 

Scattering crosstalk can be substantial if both waveguides have a 
sinusoidal imperfection of equal mechanical frequency that persists 
throughout the entire length of the guides. If such an imperfection 
should exist, it would be necessary to isolate the waveguides from 
each other by providing a surrounding medium with loss. However, 
it appears unlikely that such a systematic sinusoidal deformation 
should exist since it would require that a definite mechanical period 
with a frequency inside of the range given by (23) should have been 
generated by the manufacturing process. 

Any other imperfections that are of a statistical nature are much 
less serious even though they might lead to radiation patterns with 
narrow radiation lobes. It appears likely that scattering crosstalk 
will not present a problem particularly since the radiation losses of the 
waveguides must be kept low in order for the guides to be useful. Un- 
less a definite sinusoidal imperfection does exist it seems unnecessary 
to provide loss in the medium separating the two waveguides in order 
to reduce the scattering crosstalk. 
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Dynamic Channel Assignment in 
High-Capacity Mobile Communications 
Systems 


By D. C. COX and D. O. REUDINK 
(Manuscript received January 4, 1971) 


Computer simulations of high-capacity mobile radio systems using 
different channel assignment philosophies are described. These simula- 
tions initiate call attempts and move vehicles about randomly accord- 
ung to prescribed statistical distributions. Base stations and radio 
channels are assigned to serve mobiles and system operating statistics 
are accumulated. Relationships between systems parameters obtained 
from the simulation are presented. Performance of a dynamic channel 
assignment system (DYNSyS) which has all channels available at all 
base stations is compared with performance of a fixed-channel assign- 
ment system. (FIXSYS) which reserves channel subsets for use at speci- 
fic base stations. 

For uniform. spatial distributions of call attempts and 40-channel 
systems with reuse intervals of four base station radio coverage areas, 
the pynsys outperforms the rrxsys at blocking rates up to 13 percent. 
For example, at a 3 percent blocking rate the pynsys provides 20 per- 
cent more calls “on” in the system. 


I. INTRODUCTION 


High-capacity mobile radio systems with large numbers of radio 
channels have been proposed to relieve the over-crowded conditions 
existing in mobile radio communications today. In order to make effi- 
cient use of frequency spectrum, radio coverage from base stations 
would be limited to small zones and channels would be reused several 
times within a particular urban area.+~* Figure 1 depicts such a chan- 
nel-reuse situation along a single line of base stations. Computer con- 
trol of such systems certainly would be necessary. Bounds on the 
behavior of simplified systems have been obtained previously? but the 
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Fig. 1—Illustration of channel reuse. 


problem of determining the performance of these complex systems by 
rigorous analytical methods does not appear tractable. Since little is 
known about the overall performance of dynamic channel assignment 
systems computer simulations were run for postulated systems whose 
parameters were chosen somewhat arbitrarily; however, these param- 
eters (e.g., coverage cell size, call durations, number of channels, veloci- 
ties, etc.) were selected within the wide range of values which might be 
experienced in an actual system. This paper presents some performance 
characteristics of two dynamic mobile radio systems obtained by 
simulation on a high-speed digital computer. This provides a means for 
comparing two basically different system disciplines. [t should be 
noted that more work is required to simulate performance charac- 
teristics for actual system configurations designed to operate optimally 
within a specific set of conditions. No consideration has been given in 
this study to either the time required to develop the different system 
configurations or the economic factors necessary for final system 
design. 

Channel assignment in the first system is made on a dynamic basis, 
that is, a channel is assigned to a requesting vehicle and its covering 
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base station by a method which considers the channels in use at that 
instant at that base station and at base stations within some region 
surrounding the vehicle. Once a channel is assigned it will generally 
stay with the vehicle for the duration of the call. This may require the 
assignment of other base stations to that channel to cover the vehicle 
as it moves about. 

Channel assignment in the second system is made on the basis that 
only a fixed subset of the channels available to the radio system are 
available for use at a given base station. The essential difference be- 
tween these systems is illustrated in Fig. 2 where M-N indicates the 
channels available at each base station for a system example with 15 
channels and a reuse interval of every third base station. 

The first part of the paper describes a computer simulation of both 
systems. The second part presents performance characteristics for two 
different spatial-demand profiles for 40-channel systems. Throughout 
the paper a channel is defined as a two-way duplex radio channel. 


II. OVERALL MOBILE RADIO SYSTEM OPERATION 


This section describes briefly the functions which must be performed 
in the operation and control of a high-capacity mobile radio system 
and indicates which portions of the system are simulated. The major 
characteristics of a system are illustrated in Fig. 3. The box labeled 
“call initiation and vehicle motion” may be regarded as the driving 
function. A radio system must react to the situations created by this 
driver. These situations involve such parameters as the rate of occur- 
rence and the distribution of locations of new call-attempts. Because 
coverage areas will be crossed as vehicles move, direction of travel and 
speed distributions are other influencing parameters controlled by the 
driver. 

Since the radio path is usually not line-of-sight, propagation effects 
must be taken into account. The goals in this case are to assign the 
best base station or base stations to provide good service to the mobile 
and to make efficient use of the radio spectrum. The way of dealing 
with propagation effects can be quite varied and is closely tied to the 
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Fig. 2—Fixed-channel assignment system example. 
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philosophy under which base station assignments are made. One ap- 
proach could be to attempt to force propagation effects to have a mini- 
mal influence by restricting the use of signal from each transmitter out 
to a fixed radius and then providing a margin to handle the statistical 
fluctuations. This would produce strong signals beyond that radius in 
some regions and thus would be bad from the standpoint of frequency 
reuse. Another extreme would be to have stored the exact coverage 
areas from every base station. In this case all propagation effects are 
known. The problem here is that of being able to store, retrieve and 
make intelligent decisions based on the information in a short period 
of time. The criterion for selection of the proper base station to serve 
a mobile is to provide the maximum channel reuse consistent with good 
service. 

When a mobile requests service, a channel must be selected to serve 
it from the selected base station. Choosing the most distant channel 
currently being used may not always be the proper choice. This ap- 
proach may minimize cochannel interference problems, but would be 
poor from the spectrum reuse point of view. 

Since many of the important performance parameters of a mobile 
radio system cannot be obtained analytically and competing systems 
would be far too costly to build up in order to make the comparisons, 
computer simulation can be a powerful tool in providing answers for 
system design. At this stage the simulation to be described has sim- 
plified some of the operations illustrated in Fig. 3, but was written 
with the flexibility necessary for greater sophistication. Presently the 
model is one-dimensional. This causes some difficulty in extrapolating 
results to a two-dimensional mobile radio system but the results 
have direct application to a system laid out along an expressway 
or possibly to air-ground telephony. It also serves as a valid frame- 
work within which to make comparisons of different channel assign- 
ment schemes. Call initiations and vehicle movements are assumed to 
be random processes which will be described in detail in Section ITI. 
The propagation path between base and mobile was assumed to be such 
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Fig. 3—Functions of a high-capacity mobile radio system. 
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that there is a smooth variation (1/R¢, for example) of signal strength 
as the vehicle moves with respect to the base station. This assump- 
tion is of course unrealistic in many cases, but it permits the study 
of channel assignment methods separately from the effects of random 
propagation. It is assumed that there exists a distance interval over 
which the same frequency may not be reused. For each mobile it is 
further assumed that at any given instant there is only one base sta- 
tion which will provide a signal greater than the required thermal 
noise threshold and simultaneously a signal-to-cochannel interference 
ratio greater than some acceptable value. A discussion of the simula- 
tion of the channel assignments and of the system updating is pre- 
sented in Sections IV and V. 


Ill. CALL INITIATION AND VEHICLE MOVEMENT IN THE SIMULATIONS 


Call initiation and vehicle movement in the simulation are handled 
in a single subroutine which is used to drive several different operating 
systems subroutines. The flow-chart in Fig. 4 illustrates the major 
sequences in this driver subroutine. For each system cycle, the sub- 
routine steps through each subscriber in sequence and checks his 
activity status. Depending on events which have occurred in previous 
system cycles, a subscriber will be either “on” in the system or “off.” 

If the subscriber is not “on” in the system, the right-hand branch 
of the chart is followed. The first step along this branch is to generate 
a random number which determines whether or not a call-attempt 
will be made. If the result is not to make an attempt, that fact is 
noted and the subroutine steps to the next subscriber. If the result is 
to attempt a call, then another random number is generated from a 
uniformly distributed set to determine the subscriber’s location in the 
one-dimensional universe of the simulation. This method of generating 
call-attempts produces call arrivals which are Poisson processes 
in time at each base station and are uniformly distributed in space. 
This uniform distribution in space can be weighted to produce any 
desired spatial distribution by using counters which reject certain 
attempts at some locations. A velocity is assigned to the subscriber 
for each call-attempt produced. This velocity is assigned from a 
random number set which has a prescribed density function. The 
simulation currently uses a truncated Gaussian velocity distribution. 
If the call-attempt is located near an “edge” of the one-dimensional 
space, it is checked against criteria used to compensate for the effects 
of the edges and simulate an infinite system. This compensation then 
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Fig. 4—Driver program. 
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cancels some attempts near the edges before they actually enter the 
system. 

The left-hand branch of Fig. 4, starting at the status block, indicates 
the procedure followed for subscribers that are already “on” in the 
system. The first check is whether or not the subscriber hangs up. 
Call terminations are determined in a way that permits specifying 
the density function for call durations. The simulation currently uses 
a truncated Gaussian call duration distribution. Subscribers termi- 
nating calls are accounted for in the system statistics and flagged for 
the system operating subroutine. A subscriber who does not hang up 
is moved according to his predetermined velocity and the length of 
time between system cycles. His new location is then passed on to the 
system operating subroutine. 

After a vehicle is moved, a random number is generated which 
determines if the velocity should be changed. If a velocity change is 
indicated, the new velocity is assigned at random following the same 
procedure used to determine vehicle velocities for call attempts. After 
vehicles are moved, those near the “edges” are again checked against 


DYNAMIC CHANNEL ASSIGNMENT 1839 


an edge compensation criterion and appropriate statistical values are 
accumulated. 


IV. A DYNAMIC CHANNEL-ASSIGNMENT OPERATING SYSTEM 


In the current simulation a single operating system subroutine de- 
termines the appropriate base station to serve a call-attempt, assigns 
a radio channel if one is available or refuses service otherwise, and 
updates. channel assignments as vehicles move about. The flowchart 
in Fig. 5 illustrates the major sequences in this operating system. For 
each system cycle, this subroutine steps through each subscriber who 
is either “on” in the system or is attempting a call and checks his 
activity status. 


4.1 New Call-Atiempts 


New call-attempts are processed along the center branch of the 
flowchart. The first step in processing an attempt is to determine 







INCREMENT 
SUBSCRIBERS 












CALL IN 
PROGRESS 

















NEW 
BASE STATION 
REQUIRED 


CLEAR HANG UP 
SYSTEM 

NEW CALL-ATTEMPT 

DETERMINE 


BASE 
STATION 
















DETERMINE 
NEW BASE 
STATION 









NEW 
CHANNEL 
REQUIRED 









CHANNEL 
SEARCH 













BLOCK CALL 
FORCE 
TERMINATION 


ASSIGN 
CHANNEL AND 
BASE STATION 







CHANNEL 
AVAILABLE 


ACCUMULATE 
STATISTICS 


Fig. 5—System control. 
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which base station (or stations) should be used to serve the call. 
Because of the simple propagation characteristics assumed so far in 
the simulation, the base station nearest to the subscriber has the 
strongest received signal from the subscriber and will be the only one 
with adequate signal strength to serve. It is the station assigned to 
serve the call-attempt. If the base station is near an “edge” of the 
one-dimensional space, additional checks of the calls “on” at the base 
station are made. Some call-attempts near the edges of the one- 
dimensional space are refused service at this point if the total calls 
“on” at the base station to be assigned exceed some value determined 
by the edge compensation criterion. 

The next step in processing a new call-attempt is to search for a 
radio channel which satisfies some channel assignment criteria. The 
simulation will permit many different assignment criteria to be investi- 
gated. Because of the simple propagation model assumed at this time, 
the current channel assignment criterion assumes that the base sta- 
tions separated by greater than a certain reuse interval will not inter- 
fere with each other when they use the same radio channel (i.e., there 
will be no cochannel interference). The current dynamic channel 
assignment criterion assumes that all radio channels are available at 
all base stations. It also assumes no limit on available radio equip- 
ment at the base stations. This last assumption is somewhat unrealistic 
but it will permit the determination of upper bounds on actual radio 
equipment requirements. These equipment limits will then be included 
in later simulations. Channel search is done channel by channel start- 
ing with Channel 1. If the channel being checked is in use at: the base 
station to be assigned, then that channel is rejected and the next 
channel is checked. If a channel is not in use at that base station, a 
check is made base station by base station on both sides of that base 
station. If the channel is in use at a base station closer than a reuse 
interval away from the base station to be assigned, the channel is 
rejected and the next channel is checked starting again at the begin- 
ning of the channel check procedure. If the channel is not in use at any 
base station less than a reuse interval from the base station to be 
assigned, then further checks on that channel are made and it is com- 
pared with other channels, if any, which also have met this reuse inter- 
val criterion. If, at any time during the channel searching, a channel 
is found which is not in use at any other base station in the system, 
that channel is flagged for assignment and the channel search is ter- 
minated. Otherwise, channels which meet the reuse criteria are inter- 
compared to find the “most desirable one.” The most desirable one 
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will depend on the system operating parameter which is to be mini- 
mized, but the optimum choice has not been determined yet. For this 
simulation, the channel flagged for assignment is the one whose next 
“in use” base station is closer to the base station to be assigned than 
any of the other channels satisfying the reuse criterion but is at least 
a reuse interval plus an integer number, J, of base stations away from 
the base station to be assigned. The use of J = 0 packs assignments 
as close as possible on one side of the base station to be assigned but 
increases forced terminations of calls when boundaries are crossed. 
The use of J = 1 relieves the forced terminations slightly but results 
in a slightly greater separation between base stations using the same 
channel. In any case, if the only channel satisfying the reuse criterion 
is ‘‘on” at a base station exactly one reuse interval from the base sta- 
tion to be assigned, that channel is still flagged for assignment. This 
channel search strategy never allows a channel to be assigned at base 
stations less then a reuse interval apart, but attempts to pack channel 
assignments close together for efficient channel usage. 

If, after checking all radio channels in the system, no available 
channel is found, the call-attempt is refused service (the call is 
blocked) as indicated on Fig. 5. Blocked call-attempts are completely 
removed from the system and no other attempt to serve them is made. 
On the other hand, if an available channel is found, it is assigned to 
the vehicle and the base station to be assigned and the call setup is 
complete. Statistics are accumulated and the next subscriber checked. 


4.2 Calls in Progress 

The processing of calls in progress can be followed by returning 
to the status check block at the top of Fig. 5 and following the right- 
hand branch. The first step along this branch checks the suitability 
of the current base station assignment. This is done by checking to 
see if the vehicle has moved (during its last pass through the driving 
subroutine) closer to some other base station than the one currently 
serving it (and thus has a stronger signal at the other base station). 
If it has not moved into such a situation, then its current base station 
and channel assignment are acceptable and no further action is taken 
except to account for the call in the system statistics. If, however, the 
vehicle has moved closer to another base station, that “now closer” 
station must be used to serve the call. As indicated on the flowchart, 
the assigned channel is then checked at the new base station to be 
used and at all base stations within a reuse interval of the new one. 
If the channel passes this check, that is, is not in use at those stations, 
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the new base station and old channel are assigned to serve the call and 
the old base station is cleared of the call. If the old channel is in use 
within a reuse interval of the new base station, then a whole new 
channel search is initiated identical to the one used for new call- 
attempts. In the case of a call in progress, the only difference is that, 
if a substitute channel is not available, the call is forced to terminate 
instead of being refused. Only one attempt is made to switch channels 
for such a call in progress. In either case, the old base station is 
cleared of the call, statistics are accumulated, and the subroutine 
steps to the next subscriber. 


4.3 Call Terminations 


The only actions required to process call terminations are to clear 
the channel at the assigned base station and accumulate the appro- 
priate statistics. These actions are indicated on the left-hand branch 
of Fig. 5. 

The dynamic channel-assignment operating system used in the simu- 
lation and just described follows a simple, straight-forward operating 
procedure. No doubt, more efficient channel usage could be achieved 
if the system were continuously checked call by call and channels 
changed on calls in progress to more optimally pack the calls in the 
system and free more channels for new calls. Similarly, more efficient 
initial channel assignments could be achieved if attempts were made 
to change channels of calls in progress to try to free a channel for 
serving a call attempt which would now be blocked. Fewer new calls 
would be blocked if new callers were given several attempts over a 
short time period to obtain a channel] rather than the one attempt 
allowed in the current simulation. 


V. A FIXED-CHANNEL ASSIGNMENT OPERATING SYSTEM 


A single subroutine simulates the fixed-channel assignment operating 
system. This subroutine, which is similar to the dynamic channel 
assignment subroutine, determines the appropriate base station to 
serve a call attempt, assigns a radio channel or blocks the attempt 
depending on the availability of a radio channel, and updates channel 
assignment as vehicles move about. Fig. 5 also illustrates the major 
sequences in this operating system but some of the block functions are 
different for this system. Again, for each system cycle, the subroutine 
steps through each subscriber and takes appropriate action. 
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5.1 New Call-Attempts 


Processing of new call-attempts for the fixed-channel system is 
identical to that for the dynamic system down to the channel search 
procedure. The base station assigned to serve a call-attempt is the 
one nearest to the subscriber as discussed for the dynamic system. 

In the fixed-channel system, a subset of the radio channels avail- 
able to the system is permanently reserved for each base station. The 
subsets are reused at base stations separated by the reuse interval 
described previously. The number of channel subsets is the same as 
the number of base stations in a reuse interval, J. Every Ith base 
station uses the same channel set. (See Fig. 2.) The number of chan- 
nels, Cs, in a subset is 


C, = C,/I 


where C; is the total number of channels available to the whole sys- 
tem. Thus, each base station has C, channels reserved for its exclusive 
use. Channel search, then, only involves searching through the C, 
channels of the subset reserved for the base station to be assigned. If a 
channel from this subset is not in use, then it is assigned to serve the 
call. If no channel from this subset is available, the call attempt is 
refused service (the call is blocked). 


5.2 Calls in Progress and Call Terminations 


Processing of these calls also proceeds as in the dynamic system. 
A check is made to see if the vehicle has moved so that a different 
base station is required to provide service. If a different base station is 
not required, no system action is necessary. If, however, a different 
base station is now required, that base station is determined and for 
this fixed-channel system the only recourse is another channel search. 
The channel assigned at the previous base station is not available at 
adjacent stations so the alternative, available to the dynamic system, 
of continuing its use is not possible. This alternative, indicated on 
Fig. 5, is thus not applicable to the fixed system. The channel search 
at the new base station to be assigned is identical to the one described 
for new call-attempts. In this case, if no vacant channel is found, 
the call is terminated. 

Subscribers who hang up are cleared from the system and statistics 
are accumulated as in the dynamic system. 
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VI. THE DRIVING PARAMETERS 


The operating systems were tested for a 24-base-station 40-radio- 
channel network where the base station separation was assumed to 
be 2 miles; the minimum reuse spacing was 8 miles. This simulation 
assumed 1000 subscribers and operated with a 10-second cycle time, 
that is, within 10 seconds (in simulated time) the status of each sub- 
scriber was examined. In a true system only subscribers making or 
attempting calls are of interest. For simulation purposes, however, it 
was found more convenient to gather statistics and create demand 
by cycling through all subscribers. Statistics were taken from only 
the central 14 base stations to avoid any effects from the edges of the 
finite system. 

The spatial location of call-attempts will vary with traffic intensity. 
To provide a reference system, Section 7.1 considers a uniform spatial 
distribution of call-attempts. In Section 7.2, an example of a non- 
uniform call-attempt distribution is simulated. 

Call-attempts occur at random points in time (i.e., call-attempts are 
a Poisson process), therefore, the time difference between call-attempts 
should have an exponential density function with its mean equal to 
its variance. A histogram of the time differences for a typical data 
run is plotted in Fig. 6 along with a theoretical exponential curve. 

The distribution of the duration of mobile telephone calls for a 
high-capacity system is unknown. There is, however, some minimum 
length of time which a mobile caller will hold a channel. For example, 
even a call where no one answers will create a demand on the mobile 
radio system of perhaps 380 seconds. The distribution of call durations 
for this simulation was assumed to be a truncated Gaussian with a 
minimum call of 30 seconds and a maximum call of 10 minutes. The 
mode call time was chosen to be 1-1/2 minutes with a standard de- 
viation of 1 minute. The mean for this assumed distribution is 103.5 
seconds. Figure 7 shows both a theoretical curve of the density func- 
tion and a histogram of call lengths taken from a 1000-cycle data 
run. 

The distribution of vehicle velocity was also chosen to be a truncated 
Gaussian. In this example, the maximum speed was 60 miles per hour; 
the distribution had a zero mean and a variance of 30 miles per hour. 
A theoretical velocity distribution and a histogram from the simula- 
tion are plotted in Fig. 8. 
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Fig. 6—Time difference between call-attempts histogram. 


VII. EXAMPLES OF SYSTEM PERFORMANCE 


7.1 Uniform Call-Attempts 

The performance of a mobile radio system is measured knowing 
the interrelations between such parameters as the call-attempt rate, 
percent of call-attempts blocked, number of calls being handled, num- 
ber of vehicles crossing cell boundaries, and number of calls being 
forced to terminate. 

The location of call-attempts for this part of the simulation was 
chosen to be a flat distribution in space with appropriate edge tapering. 
Figure 9 shows the number of call-attempts at each base station for 
two typical data runs. Since it was desirable to take averages over 
this ensemble of 14 base stations the number of system cycles was 
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Fig. 9—Average call-attempt rate at 14 base stations for two attempt rates. 


increased until the deviation from the mean number of call-attempts 
by an individual base station was small. 

As an aid in understanding the details of the system operation, a 
special program was written to view at regular instants of time the 
status of the mobile radio system. Specifically, a computer printout 
was generated which indicates the locations where radio channels are 
being used and prints a symbol unique to that channel. The appro- 
priate symbol for each of the 40 two-way channels is shown in Table 
I. In addition, two other special symbols are used. The $ is printed if, 
during the time interval under consideration (usually taken to be 10 
seconds), a request for service was initiated but no channel was avail- 
able and service was refused. The $ is printed at the location where 
the call was attempted. An * is printed when a vehicle crosses a cell 
boundary and the user is forced to terminate his call prematurely 
because a channel is no longer available for him. 

Figure 10 illustrates the dynamic channel assignment system, 
DYNSYS, activity for 500 seconds for base stations 11 through 16. In 
the example, about 10 percent of the call-attempts are being blocked. 
The circled track 1 shows a typical call within the coverage area of 
base station 13, where the caller is assigned channel 39, designated by 
an open parenthesis. After 80 seconds a change in velocity occurred 
and the call terminated after 140 seconds. Recalling 2-mile base 
station spacing and 10-second time intervals, the velocity can be 
roughly calculated from the figure. In this example, the initial velocity 
was about 45 miles per hour and the final velocity was about 30 miles 
per hour in the opposite direction. Track 2 in this same figure shows a 
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TABLE J—CHANNEL IDENTIFICATION 


Channel Number Symbol 
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vehicle with channel C crossing a boundary and maintaining the same 
channel. That is, the channel assigned to the call “floats” with the 
vehicle. When the system activity is quite high, not all callers crossing 
boundaries can keep the same channel. Track 3 shows a vehicle, 
originally assigned channel G, reassigned to channel § as it crosses 
the boundary between base stations 11 and 12. Note, the channel 
change was required because channel G was in use (circle 3’) at base 
station 15 which is less than a reuse interval from base station 12. 
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Fig. 10—Examples of dynamic channel assignment system operation: 
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Fig. 11—Examples of channel reuse with dynamic channel assignment 
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Fig. 12—Examples of channel reuse with fixed-channel assignment 
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Track 4 shows a forced termination (of channel A) occurring at the 
boundary between base stations 12 and 13 because the channel is in 
use at base station 16 (circle 4’), and no other channels are available. 

In the pyNnsys, any channel is available at any base station. In the 
fixed-channel assignment system, FrxSys, with a reuse interval of 4, 
the 40 available channels were apportioned as shown in Table II. In 
the Frxsys, vehicles are not allowed to keep frequencies when crossing 
boundaries. Therefore, a track corresponding to track 2 in Fig. 10 is 
not possible. 

Channel reuse for the two types of systems is illustrated in Figs. 
11 and 12. In the pynsys, calls are served on a first come first serve 
basis regardless of location. This system must hunt for an available 
channel which is not being used within a reuse interval on either side 
of the new caller. The highlighted tracks of Fig. 11 illustrate system 
usage of channels X and H during a time period of 4 hour. The 
FIXSYs usage of channels H and N is illustrated in Fig. 12. 

As the systems were running, data were taken on operating param- 
eters such as the number of call-attempts, hang-ups, boundary cross- 
ings, etc., which occurred. These performance statistics were accumu- 
lated over runs of 1000 system cycles. Blocking is expressed as the 
ratio of call-attempts not finding a channel to the total call-attempts 
_ for a given time period. Forced terminations, which occur when a 
vehicle crosses a radio coverage boundary and cannot find an avail- 
able channel, are expressed as the ratio of calls forced off to calls 
served for a given time period. Calls-keeping-channel are expressed 
as the ratio of calls keeping the same channel when crossing a coverage 
boundary to the total number of boundary crossings for a given time 
period. This parameter indicates the degree to which channel coverage 
“floats” with vehicles. 

Figures 13 and 14 characterize the number of calls “on,” blocking, 
forced terminations, and calls retaining their channel as a function 


TaBLE II—CHANNEL ALLOCATION FOR FIxED-CHANNEL 
ASSIGNMENT SYSTEM 


Base Station Channels Available 


, 9, 13, 17, 21 1-10 
, 10, 14, 18, 22 11-20 
11, 15, 19, 23, 21-30 
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Fig. 13—Performance of a dynamic channel assignment system (flat loading). 


of the call-attempt rate for the pynsys and Fixsys. The values are 
averages per base station and the spatial call-attempt distribution is 
flat over all base stations. 

The channel assignment philosophy currently used in pynsys selects 
as first choice a channel being used five base stations (a reuse interval 
plus one base station) on one side or the other of the new attempt 
location. In Fig. 13 the solid points are for two attempt rates where 
the first choice for channel assignment is a channel which is being 
used only four base station intervals (one reuse interval) away. As 
expected, the calls “on” were increased, the blockage was lowered 
slightly, forced terminations increased, and fewer vehicles kept the 
same channel after crossing a boundary. 

The calls-keeping-channel curve shows that at low loading most 
vehicles are able to maintain their channels as cell boundaries are 
crossed while at very high loading, channel usage is packed more 
nearly at reuse intervals apart and fewer vehicles can maintain their 
originally assigned channel. Even at high loading, however, about 
half of the vehicles keep their channels as boundaries are crossed. 
Note, this parameter is the one most affected by the different assign- 
ment philosophies (preferred 4- or 5-station separation). 

Figure 14 presents similar performance curves for the rrxsys. Since 
channels must always be switched at boundary crossings the number 
of forced terminations is increased over the pynsys. When no motion 
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is allowed and call durations are assumed to be exponentially dis- 
tributed, it is possible to calculate (given the average call duration) 
from trunking formulas a theoretical expression for the blocking 
probability versus the number of calls “on” at a base station. The 
FIXSYS simulation, which allows motion and assumes truncated Gaus- 
sian call time distributions, agrees almost exactly with the trunking 
formula curves (Calls On and Calls Blocked) in Fig. 14 and Fig. 
15 (dashed curve). Under the current operating conditions, motion 
and call time distributions have negligible effect on these two 
parameters. 

The Dynsys performance is compared to the FrxSys performance 
in Fig. 15. In the low blocking region (below 10 percent) which is 
where a radio system should normally operate, the pynsys handles 
more calls than the rixsys. It provides about one extra call per base 
station (20 percent increase) at 3 percent blocking. At high blocking 
rates, the FIXsys operates more efficiently; the reason is that this sys- 
tem can achieve a maximum number of calls “on” when there is 
always a call waiting at each base station. The pynsys could, of course, 
be programmed to switch over to this mode of operation as the system 
saturates. 

As shown in Fig. 9, call-attempts were chosen to be a flat distribu- 
tion in space. The corresponding reaction to the call-attempts is the 
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Fig. 14—Performance of a fixed-channel assignment system (flat loading). 
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Fig. 15—Channel usage (flat loading). 


average-calls-on distribution which is plotted in Fig. 16 as a function 
of base station for two demand rates for the DyNsys. 

The pyNnsys simulation allowed for the possibility that all channels 
could be used simultaneously at any base station. A histogram of the 
actual number of calls “on” for two different attempt rates is shown 
in Fig. 17a and b. It is immediately obvious that for a flat spatial 
distribution of call-attempts, halving the number of radio transmitters 
would not affect the system performance, since the number of channels 
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Fig. 16—Average number of calls “on” for two attempt rates. 
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Fig. 17a—Histogram of channel usage per base station for dynamic channel 
assignment system with a calls-on average of 7.1. 


Fig. 17b—Histogram of channel usage per base station for dynamic channel 
assignment system with a calls-on average of 8.66. 


used at a base station seldom exceeds that value. The design of the 
FIxsys allows for a maximum of 10 transmitter channels at any base 
station. Similar histograms of the calls-on distribution for the FIxsys 
are presented in Fig. 18a and b. 


7.2 Nonuniform Call-Attempts 

The data presented to this point were produced under the assump- 
tion that the spatial distribution of call-attempts is uniform. It is 
reasonable to expect that variations in call-attempt rates might occur 
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Fig. 18a—Histogram of channel usage per base station for fixed-channel assign- 
ment system with a calls-on average of 5.1. 


Fig. 18b—Histogram of channel usage per base station for fixed-channel assign- 
ment with a calls-on average of 7.08. 


during rush hours and there certainly will exist random occurrences 
of highly localized demands. 

The following example is presented as a plausible situation to 
demonstrate the reactions of the two systems. Assume that the call- 
attempt density within the coverage area of two adjacent base stations 
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is four times higher than the other base stations in the system as illus- 
trated below: 


4 — e e 


Attempt Rate 
(Normalized) 


Base Stations 


Such a situation could occur when traffic converges at the center of a 
city during the morning rush hours. 

One can calculate the blocking versus calls-on for the Frxsys using 
the trunking formulas referenced earlier. The comparative perform- 
ance of the DyNSys was obtained by simulation. The values plotted 
in Fig. 19 are averages taken over one reuse interval (four base sta- 
tions) centered on the two base stations with higher call-attempt 
rate. Since the same 1,000 system cycles were used in the simulation 
and the averages could not be taken over all base stations, the ac- 
curacy of the pyNsys points is degraded compared to those presented 
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Fig. 19—Channel usage (peaked loading). 


1856 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


in the flat loading examples in Fig. 15. However, since there is such 
a marked difference in performance between the two systems more 
precise calculations of the points were not felt necessary. In this ex- 
ample, we see that the rrxsys can handle only 40 percent of its average 
capacity of 10 calls “on” per base station, while maintaining 10 
percent blockage, whereas the pyNsys can work up to 70 percent 
capacity at the same 10-percent blockage level. 


VIIT. SUMMARY 


The simulations described readily produce system performance 
parameters which are difficult, if not impossible, to obtain analytically. 
The system driving routine provides flexibility in parameter variation 
such as in demand profiles and distributions of call durations. Many 
channel assignment philosophies and system configurations can be 
readily explored by the versatile system operating routines. The 
graphical techniques used to display system operation for each system 
cycle provide a vivid picture of system dynamics. 

The simulated points for uniform spatial distributions of attempts 
were averaged over a time sufficient to minimize statistical fluctua- 
tions. This is illustrated by the close fit of the points to the smooth 
curves. The fact that the simulated data for the fixed-channel assign- 
ment system agrees closely with calculations based on telephone 
trunking formulas lends credibility to the simulations and to the use 
of these formulas for determining the performance of such systems. 

Portions of the system control routines such as the channel assign- 
ment procedures and the method of coding vehicle channel and base 
station assignments could be used essentially without modification for 
controlling an actual high-capacity mobile radio system. 

When operating with a uniform spatial distribution of attempts and 
with the fixed-channel assignment system optimally matched to this 
distribution, the dynamic channel assignment system out-performs 
the fixed system for blocking rates less than 10 percent. In fact, at 
a blocking rate of 3 percent, the dynamic system handles 20 percent 
more calls per base station than the fixed system. 
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Antenna Spacing Requirement for a 
Mobile Radio Base-Station Diversity 


By W. C. -Y. LEE 
(Manuscript received February 8, 1971) 


The minimum required antenna spacing between two base-station 
antennas in order to take advantage of spatial diversity technique 
was investigated. The measurements were made for two cases: (1) 
the incoming radio signal was perpendicular to the axis of two base- 
station antennas (the broadside case), and (it) the signal was in-line 
with the axis of two base-station antennas (the in-line case). The 
correlation of signals received from two separated antennas at the 
base station was found to be much higher for the in-line case than for 
the broadside case with any given antenna spacing. For correlation up 
to 0.7, from which most of the advantage of two-branch diversity can 
still be obtained, we found the minimum required antenna spacing is 
around 70-80 for the in-line case and 15-20d for the broadside 
case. In order to achieve a correlation always less than 0.7 between 
two base-station signals regardless of the arrival direction of the 
incoming signal, a triangular configuration with a three-antenna array 
used with a three-branch diversity receiver is proposed, requiring less 
antenna spacing in the array than for a two-antenna. setup. 


I. INTRODUCTION 


It has been shown previously that diversity reception techniques at 
the mobile unit often help to reduce the fading rate of a mobile radio 
signal.-* Here we try to determine the limitations on using space 
diversity at the base station in order to reduce the signal fading. In 
particular, we would like to know how large the antenna spacing 
should be between two base-station antennas in order to take ad- 
vantage of the diversity technique. If this turns out to be practical, 
then we might prefer to build a diversity system, even a complicated 
one, from the economical point of view at the base station, and let 
the transmitter and receiver in the mobile unit be as simple as pos- 
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sible.*> In the experiments we varied the antenna spacing between 
two base-station antennas and calculated the cross-correlation of the 
envelopes of the signals received from these antennas for a number of 
different antenna spacings. We also determined the theoretical relation- 
ship between the cross-correlation and the cumulative distribution 
curve of the combined signal. If we select a particular cumulative dis- 
tribution curve as acceptable for diversity operation, the corresponding 
correlation then indicates the required antenna spacing. 

Using the experimental correlation data of both the in-line propa- 
gation case and the broadside propagation case as a guide, we will try 
to reduce the required antenna spacings by using an array of three 
base-station antennas. 


II, EXPERIMENTS 


A mobile radio transmitter was set up at 886 MHz in a station wagon 
with a \/2 dipole mounted vertically. This transmitted to two receiving 
horns having 24 degrees beam width in the horizontal plane and located 
at the north end of Crawford Hill in Holmdel, New Jersey. During the 
measurements the station wagon was driven at a constant speed of 15 
mph on three selected streets in the Keyport area, as shown in Fig. la 
and b. The horn antennas were used to reduce the local scattering at 
the base, thus simulating the conditions at a typical installation where 
the basic antenna is mounted well above nearby objects. Two of the 
streets chosen were in line with the radius vector of the base station, 
and the other was perpendicular to the radius vector. (The symbols e— 
and <-e in Fig. la and b indicate different runs on the same streets. 
The dot indicates that the data have been received around this point on 
the street, and the arrow indicates the motion of the transmitter.) 
Figure la shows the experimental set up for the case where the incoming 
radio signal is perpendicular to the axis of two base-station antennas 
(the broadside case). In this case, the S — N runs were closer to the base 
station than the N —S runs on Main Street and on Broadway, as shown 
in Fig. la. The distance from the mobile transmitter to the base- 
station receivers was about three miles. The separation between the two 
base-station receiving antennas was variable from 25) to 70\ in 10A 
steps starting from 380A, 1.e., 25d, 30, 40, --: . 

Figure 1b shows the experimental setup for the case where the in- 
coming radio signal is parallel to the axis of two base-station antennas 
(the in-line case). In this case, the S ~ N runs and N — §S runs were 
made approximately in the same sections of the streets. The distance 
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Fig. la—Map of testing area (Keyport, N. J.), the broadside case. 
Fig. 1b—Map of testing area (Keyport, N. J.), the in-line case. 
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from mobile transmitter to the base-station receivers was also about 
three miles. The separation between the two base-station receiving 
antennas was variable from 30A to 100 in 10 steps. The height of 
the front base-station antenna was fixed at 12 feet. The height of the 
rear antenna was variable in three steps, 16 feet, 12 feet, and 8 feet. 
For both cases, each signal received from each antenna was fed to 
a separate receiver, which was calibrated from a common source. The 
two signals were recorded simultaneously on a magnetic tape recorder. 


Ill. EXPERIMENTAL CROSS-CORRELATIONS 


All data obtained from the two-channel base-station receiver were 
digitized at a 500-Hz sampling rate, which is fast enough to sample 
the data adequately. There are several questions which need to be 
answered before calculating the cross-correlation of two received sig- 
nals. First, are the mean values of a signal in different time intervals 
(local means) different, and, if so, how should we compensate for this? 
Second, how many sample points should be taken for each of two 
received signals to calculate their cross-correlation? Third, are the 
processes of the signals we received stationary in the wide sense?® 
These can tell us that the correlations we have obtained are inde- 
pendent of time. Statistically analyzing our data, we have had to 
justify several statistical properties of individual pieces of data before 
calculating the cross-correlations among them: 


(1) Treatment of Local Means—We have found that the mean values 
of a signal in different time intervals (local means) are different. The 
variations in the local means of a received signal are due to the dif- 
ferent contours of land between the transmitter and the receiver at 
different time intervals. These variations affect the auto-correlation 
function of a received signal. Hence, we normalized the local means 
of the received signal, before calculating its auto-correlation, as fol- 
lows: We broke a piece of data into many time intervals, then each 
sample was divided by the mean of the signal in that time interval. In 
our data, we used a time interval a half-second long, 1.e., 250 samples. 

(it) The Number of Sample Points to be Taken-We have found 
from the auto-correlation curves of most data that their first nulls 
occur at a delay, 7, of 20 samples. Then we may say that there is no 
correlation between two pieces of data separated by a delay of 20 
samples. Since these pieces of data are Rayleigh distributed,‘ if they 
are uncorrelated, they are also independent. Now we look at this a 
different way: how long will a piece of data be which can from an 
ensemble of , say, 100 independent subsets of data? Since every subset 
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of data with a delay of at least 20 samples from each other is an 
independent subset, a piece of data containing over 2000 samples is 
required to provide an ensemble of 100 independent subsets. We know 
that 100 independent subsets of data are enough to calculate the 
ensemble average. Therefore, from the concept of ergodicity, we need 
a piece of data containing over 2000 samples in order to meet the 
requirement “time average equals ensemble average”. However, most 
of the local means of all runs vary rapidly after 2500 samples. There- 
fore, we chose 2500 sample points as a unit piece of data. 

(71) Verrfication of Stationarity-If a piece of data is stationary, 
the auto-correlation of a piece of data x(t) should be independent 
of time ¢. This means that if we pick any arbitrary starting point, say 
one second after the first sample point, and calculate the auto- 
correlation, the result should be the same as if we had chosen the first 
sample point as the starting point. This evidence was observed in 
our data. Hence we have shown that our data are stationary. 


In summary, we have found that the local means of two signals 
should be properly factored out, and that 2500 sample points are 
sufficient for calculating the cross-correlation. Under these two require- 
ments, the processes of the signals we received were verified to be 
stationary in the wide sense. 

Assuming two signals received from two base-station antennas, 
separated by distance d, are x(t) and y(t), the cross-correlation co- 
efficient of x(t) and y(t) delayed by time 7 can be expressed as 


“ue +) — mam, 
Vii) — mV y(t + 7) — m 





p(t, T; d) = (1) 





where m, and m, are mean values of x(t) and y(t + 7), respectively. 
Since we have shown that our data are stationary, and also we let 
t = 0, equation (1) becomes 


= QU = mam, 

@) = == —— 
V220) — mV yO — mi 

After normalizing the local means of the two signals from the two 


channels (receivers), we used 2500 sample points and calculated the 
cross-correlation as a function of antenna spacing for two cases. 





(2) 








3.1 The Broadside Case 
The cross-correlation as a function of base-station antenna spacing 
was determined for runs on three streets (Main Street on Fig. 2, Maple 
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Fig. 2—The correlation coefficient of two base-station antennas broadside with 
the incoming wave vs antenna spacing for runs on Main St., Keyport, N. J. 


Place on Fig. 3, and Broadway on Fig. 4). From these three figures, 
there are several major points to be disclosed: 


(7) The cross-correlation decreases as the base-station antenna 
spacing increases, as one would expect. 

(it) In general, the cross-correlation is somewhat higher when the 
mobile transmitter is nearer the base station, such as the S > N runs 
shown in Figs, 2 and 4. 

(27) The tops of the trees at the northwest boundary of Crawford 
Hill were high and thus partially scattered the incoming signal from 
the northwest direction. Hence, the signals received from Broadway 
and the west side of Maple Place were lower. The cross-correlations 
obtained from these locations were also low, due to this local scat- 
tering. 

(tv) To measure the cross-correlation for an antenna spacing of 70), 
we moved the west-side receiving antenna farther left by 10\. There- 
fore, the tops of trees more affected the incoming signal received by 
Channel 1 from Broadway. Hence, the two points for this antenna 
spacing shown on Fig. 4 have very low values. 

(v) The highest cross-correlation for the six runs for an antenna 
spacing of 25\ was 0.65, and for an antenna spacing of 70 was 
about 0.27. 
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Fig. 3—The correlation coefficient. of two base-station antennas broadside with 
the incoming wave vs antenna spacing for runs on Maple Place, Keyport, N. J. 
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Fig. 4—The correlation coefficient of two base-station antennas broadside with 
the incoming wave vs antenna spacing for runs on Broadway, Keyport, N. J. 
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3.2 The In-Line Case 

The cross-correlation as a function of base-station antenna spacing 
was determined for runs on three streets (Main Street, Maple Place, 
and Broadway) for three values of base-station antenna height (see 
Fig. 1b), and the results shown in Figs. 5 through 138. Each figure 
shows the correlation for a particular antenna height for runs on a 
particular street. Assuming that the correlation at zero antenna spacing 
is one, we can draw a curve for the best fit to our measured correla- 
tion points for spacing from 0 to 100A for each run. From these nine 
figures, there are several major points to be disclosed: 


(7) Between the broadside case and the in-line case: The correlation 
coefficient of the in-line propagation case is much higher than that of 
the broadside propagation case for a given antenna spacing. For 
antenna spacing = 70, the correlation coefficient is about 0.2 for the 
broadside propagation case and about 0.7 for the in-line propagation 
case. Alternatively, to get a correlation of 0.7, the antenna spacing 
merely needs to be 25d in the broadside case but 70A in the in-line case. 

(2) Direction of runs: From Figs. 5-7 (Main Street) and Figs. 
11-13 (Broadway), we find that the correlations are higher for those 
runs (S — N) when the mobile radio transmitter was traveling away 
from the base station than those (N — 8S) when traveling toward 
the base station. Figures 8-10 (Maple Place), for runs perpendicular to 
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Fig. 5—The correlation coefficient of two base-station antennas (setup no. 1) 
in ad with the incoming wave vs antenna spacing for runs on Main St., Key- 
port, N. J. 
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Fig. 6—The correlation coefficient of two base-station antennas (setup no. 2) 
in line with the incoming wave vs antenna spacing for runs on Main St., Key- 
port, N. J. 


the radius vector of propagation, show that the signal correlations 
from runs in different directions, E > W or W > E, are not noticeably 
different. 

(i) Antenna height: The variation of height of the base-station 
antennas apparently has minor effects on the correlations of the 
received signals. (Figs. 5-13). 
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Fig. 7—The correlation coefficient of two base-station antennas (setup no. 3) 
in line with the incoming wave vs antenna spacing for runs on Main St., Key- 
port, N. J. ; 1 
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Fig. 8—The correlation coefficient of two base-station antennas (setup no. 1) 
in line with the incoming wave vs antenna spacing for runs on Maple Place, 
Keyport, N.J. 


(wv) Different streets: On Main Street the buildings are taller 
and spaced closer together than on the other two streets. The build- 
ings on Maple Place are again taller and spaced closer than those on 
Broadway. This may be the reason that the average correlation drops 
slightly faster on Main Street and more roy on Broadway as uae 
antenna spacing increases. 

(v) Local scattering at the base station: Most of the correlation 


1.0 


oe E-W RUN 


0.6 
ANTENNA SETUP NO. 2 


<q <— INCOMING WAVE 


044 al le 


BASE STATION 


CORRELATION COEFFICIENT 


0.2 





0 10 20 30 40 50 60 70 - 80 90 100 
: d (ANTENNA SPACING IN WAVELENGTH) 


Fig. 9—The correlation coefficient of two base-station antennas (setup no. 2) 
in line with the incoming Wave vs antenna spacing for - runs on Maple Place, 
Keyport, N. J. 
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Fig. 10—The correlation coefficient of two base-station antennas (setup no. 3) 


in line with the incoming wave vs antenna spacing for runs on Maple Place, 
Keyport, N. J. 


curves in Figs. 5-13, are monotonically decreasing as the antenna 
spacing increases. A few curves have dips; for example, see Fig: 12. 
These are perhaps caused by the local scattering at the base station. 

(vi) Upper bound of the correlation data: Figure 14 shows the 
upper bound of the correlation data for the three antenna setups 
versus the antenna spacing d. When d = 70, the highest correlation 
‘is 0.85. 
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Fig. 11—The. correlation coefficient of two base-station antennas (setup no. 1) 
in line with the incoming wave vs antenna spacing for runs on Broadway, 
Keyport, N. J. 
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Fig. 12—The ‘correlation coefficient. of two base-station antennas (setup no. 2) 
in line with the incoming wave vs antenna spacing for runs on Broadway, 
Keyport, N. J. 


IV. EFFECT OF CORRELATION ON DIVERSITY 


Having obtained the cross-correlation as a function of antenna 
spacing, we would now like to ask how great a cross-correlation co- 
efficient between two received signals we can tolerate and still realize a 
signal improvement in a two-branch predetection diversity receiver. 
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Fig. 13—The correlation coefficient of two base-station antennas (setup no. 3) 
in line with the incoming wave vs antenna spacing for runs on Broadway, 
Keyport, N. J. 
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Fig. 14—Upper bound of correlation coefficients from different antenna setups 
at the base station (in-line case) with a mobile radio transmitting from three 
different streets in Keyport, N. J. 


To do this we will find the cumulative distribution of a two-branch 
diversity combined signal as a function of the cross-correlation of the 
two incoming signals. 

Assume that the two signals received by a two-branch diversity 
receiver are complex gaussian random variables* in a short-term 
fading (a piece of data a few hundred wavelengths long). If these two 
signals are independent, i.e., no cross-correlation, the cumulative dis- 
tribution of a two-branch maximum ratio diversity* signal can be 
expressed as’ 


Py <x) =1- (1 ae 2) exp (-2) (3) 


where y is the sum of the instantaneous SNR’s on the individual 
branches, I is the average SNR of a single channel, and z is a value 
greater than y. For the case that the two received signals are not 
independent, we can obtain the cumulative distribution from Ref. 9 
by inserting equation (10-10-21) into equation (10-10-22) for a two- 
branch maximum-ratio diversity. After some manipulation, we get 


* The cumulative distributions of a two-branch maximum-ratio diversity com- 
bining signal and a two-branch equal-gain diversity combining signal are very 
similar, differing only by 0.49 dB8 if the two branches are independent. 
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Py <2) = 1 


7 f + Vp) exp |-al es — Vp) exp a= You) 
2V p 
(4) 


where y, «, and T are as previously described and p is the cross- 
correlation coefficient between the envelopes of two-branch signals, 
Equations (38) and (4) are plotted in Fig. 15. The abscissa is the SNR 
with respect to the mean value of the SNR of a single channel, in dB. 
We see that the curve for p = 0.7 is very close to that for p = 0. 
This means that for correlations between the two branches up to 0.7, 
the advantage of using diversity technique is still good. Figure 15 also 
shows that 99.9 percent of the time the SNR for p = 0 is above —13.8 
dB, while at the same percentage, the SNR for p = 0.7 is above —15.8 
dB. The difference is 2.6 dB. Comparing the two-branch signal for 
p = 0 with a single-channel signal at the 99.9 percent level, the dif- 
ference is 17 dB. For a probability of 99.9 percent, the difference be- 
tween two two-branch signals for p = 0 and p = 0.7 is still 2.5 dB; 
however, the difference between a two-branch signal with p = 0 and 
a single-channel signal is 22 dB. Hence, we could say that, if a two- 
branch signal for p = 0.7 is taken, the improvement over a single 
channel at the 99.99 percent level is greater than at the 99.9 percent 
level. Figure 16 gives a clear view of the two-branch diversity im- 
provement over a single-channel signal for different correlations. The 
improvement becomes less as the correlation increases. Also the 
improvement becomes greater at the higher percentage signal level 
than at the lower percentage signal level, as mentioned previously. 
For p between 0 and 0.7, the diversity advantage changes very little. 
Hence, we may say that p = 0.7 is a reasonable value to pick for the 
maximum cross-correlation which can be tolerated in order to take 
good advantage of the diversity technique. The antenna spacing for 
a cross-correlation of 0.7 is around 15-25) for the broadside case (see 
Figs. 2-4) and is 70-80. for the in-line case (see Figs. 5-13). 


V. CONSIDERATION OF A THREE-BRANCH DIVERSITY SYSTEM 


As we have seen, to get a correlation of 0.7 between two base- 
station signals in the in-line case requires antenna spacings of 70A to 
80A, and in the broadside case requires antenna spacings of 15, to 
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Fig. 15—The cumulative distribution curves vs correlation for a two-branch 
maximum-ratio diversity receiver. 


25. Hence the broadside case is better than the in-line case as far 
as saving the antenna space is concerned. Since the base-station 
antennas are set up not only for one mobile unit in one particular 
direction but rather for all the mobile units under its radio coverage, 
some mobile units may well be in the in-line case to the base station 
if only two base-station antennas are considered. If we try to reduce 
the antenna spacing and still meet the same correlation requirement 
of 0.7 or less regardless of the direction of the incoming signal arrival, 
a triangular antenna array may provide a solution. In a triangular 


1874 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


2 BRANCH 


PERCENT PROBABILITY THAT AMPLITUDE >ABSCISSA 





ce] 10 20 30 
IMPROVEMENT OF SNR OF TWO-BRANCH 
OVER A SINGLE CHANNEL IN DECIBELS 


Fig. 16—Improvement of signal-to-noise ratio of a two-branch signal over a 
single-channel signal. 


array, we have three cross-correlations between three antennas. When 
an incoming mobile radio signal is in line with two antennas of the 
array, then the correlation with these two antennas is very high but 
the other two correlations are lower, since they are more nearly broad- 
side to the incoming signals. Because two of the three antennas are 
always approximately broadside to an incoming signal in a triangular 
array, we may always get at least the advantage of two-branch di- 
versity but never more than three-branch.’° Furthermore, by making 
more correlation measurements for arrival angles between in-line and 
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broadside cases in the future, the exact performance of the triangular 
array could be calculated. 


VI. CONCLUSION 


The measurements reported here show that the correlation of the 
in-line propagation case is much higher than that of the broadside 
propagation case for any given antenna spacing. To get a correlation 
of 0.7, the antenna spacing merely needs to be 25d in the broadside 
case but 70A-80A in the in-line case. Direction of runs and variation of 
height of the base-station antennas have minor or insignificant effects 
on the correlation. Local scattering at the base station may not have 
been entirely eliminated in our test; this, if present, would reduce the 
apparent correlations. In this case, larger separations might be required 
in actual situations, depending on the amount of local scattering that 
existed. Further work is needed to resolve this point. 

The theoretical analysis has pointed out that the advantage of a 
two-branch diversity can be obtained when the cross-correlation be- 
tween branches is less than 0.7. 

In order to achieve a condition on the cumulative distribution curve 
equivalent to a correlation always less than 0.7 between two base-sta- 
tion antennas regardless of the direction of the incoming signal arrival, 
yet with antenna spacing less than the in-line case, a triangular con- 
figuration with a three-antenna array tsed with a three-branch diver- 
sity receiver is proposed. We estimate that, in a triangular array, the 
antenna spacing between any two of three antennas could be around 
40 for its cumulative curve to be better than that of a two-branch 
receiver with correlation of 0.7 regardless of the direction of the incom- 
ing signal onto the triangular array. The idea of applying the diversity 
scheme at the base station could then be realized. 
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A Simple Interframe Coder For 
Video Telephony 


By J. O. LIMB and R. F. W. PEASE 


(Manuscript received February 9, 1971) 


The technique of exchanging resolution according to the amount of 
movement in a picture has been previously described; in stationary 
parts of the picture the temporal resolution 1s reduced while in moving 
parts of the picture the spatial resolution is reduced. Here, we describe 
a method of applying resolution exchange to a differentially quantized 
(DPCM) signal. The resulting channel capacity required for the sub- 
jectwely satisfactory transmission of the differential signal ts halved. 
The coder is simpler than most interframe coders and should not 
imcrease the sensitivity of the system to channel errors. 


I. INTRODUCTION 


In a previous paper we described a way to halve the channel 
capacity required for the subjectively satisfactory transmission of an 
8-bit PCM television signal by exchanging spatial and temporal reso- 
lution according to the amount of movement in the local part of the 
picture: Every second picture element (“pel”) is sampled and in 
stationary areas of the picture the values of the unsampled pels are 
interpolated from adjacent temporal samples (reduced temporal reso- 
lution) ; in the moving areas the values of the unsampled elements 
are interpolated from neighboring sampled elements in the same line 
(reduced spatial resolution). 

We would like to apply this technique to a signal whose bit rate has 
already been reduced by an element-to-element differential quantizer 
(EDQ), eg., the Picturephone® codec. Unfortunately, halving the 
horizontal sampling rate, as in Ref. 1, increases the amplitude of the 
sample-to-sample difference signal which, in turn, requires a larger 
number of quantizing levels for adequate representation. There are 
two ways around this problem: 
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(¢) Use the vertically adjacent elements as a prediction of the current 
pel and quantize the resulting difference. Because vertically adjacent 
pels are in the previous field, such a technique can be called field 
difference quantization and is the subject of another study. 

(ii) Reduce the vertical resolution rather than the horizontal resolu- 
tion so that the full horizontal sampling frequency is retained; 
every transmitted line is left completely intact. This is the method 
described here. 


II. PRINCIPLE AND IMPLEMENTATION 


In Fig. 1 we show an outline of a system which uses resolution 
exchange in conjunction with differential quantization of the signal. 
The output of the television camera is differentially quantized and 
fed to a movement detector which usually consists of a frame delay 
circuit, a difference and threshold circuit, and associated logic. When 
movement is not detected, that part of the picture being coded enjoys 
the full spatial sampling frequency but each line is only transmitted 
every second frame, i.¢e., the temporal sampling frequency is halved, 
When movement is detected, then the vertical sampling frequency is 
halved by transmitting only every second line of the picture. For a 
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Fig. 1—Schematic of communication system using simple interframe coder. 
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2:1 interlaced scan this means transmitting every other field and so the 
temporal sampling frequency is raised to 30 Hz. 

At the receiver, when movement is not detected, alternate (sampled) 
lines in each field are decoded and displayed. In place of the un- 
sampled lines in the field a temporal average of corresponding lines 
from neighboring sampled frames is formed and displayed.* 

To describe this “stationary mode” we assign to each line co- 
ordinates (y, t) which refer respectively to the vertical position in 
each frame (or line number) and the temporal position (or field num- 
ber). Figure 2 shows diagrams of lines for a 2:1 interlaced scan 
format. Each sampled line is marked with a full dot and the unsampled 
lines are shown as circles; the averaging is denoted with arrows. In 
the stationary mode we receive and display alternate lines in each 
field, i.e., lines 2, 3, 6, 7, 10, 11, -++ in fields 2 and 3 and lines 0, 1, 4, 5, 
8, 9, --- in fields 4 and 5. The averaging is done only along the time 
(t) axis and so stationary pictures are displayed with the full resolu- 
tion of the fully sampled picture. 

The results of preliminary experiments with this mode of operation 
continually applied to the whole picture show better motion rendition 
than does frame repeating but are still unsatisfactory for moderate 
and fast movement of subjects of normal contrast (the degradation 
becomes annoying at speeds of about 2 pels per frame interval). In 
the case of still pictures the quality is actually improved over that of 
normal (3- or 4-bit) EDQ because the temporal (frame-to-frame) 
averaging reduces the visibility of granular noise. 

When movement is detected, pels from alternate sampled fields are 
received, decoded, and displayed. In place of pels from the unsampled 
fields, an average is formed from the four nearest neighbors in the 
y — t diagram, as shown in Fig. 2b. For example, pels in lines (y, 1) 
are replaced with the average value of pels in lines (y — 1,0) (y + 1, 
0) (y — 1, 2) and (y + 1, 2). Thus both spatial and temporal inter- 
polation is used to form the new value in the unsampled field. 

Preliminary experiments with this mode applied continually to the 
whole picture showed adequate motion rendition for most head and 
shoulders views of a person talking; the loss in vertical resolution is 
barely noticeable, but in some regions (especially dark regions) of 
high vertical detail, some aliasing patterns can be seen. Under normal 
viewing conditions (see Section III), fast movement (4 pels per frame 
interval and above) of a contrasty edge appears slightly jerky. 


*Some other work on reducing the temporal resolution is given in Refs. 2 
and 3. 
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There are a number of ways in which the sampling mode can be con- 
trolled. The subjectively ideal, but probably most difficult, method is 
to change to the appropriate mode as each element is encountered. 
Another method is to code the whole of one line in the same mode, 
while a further method is to code the whole field in the same mode. The 
method adopted in most of our experiments is to use one mode 
throughout each field. The transitions from stationary mode to moving 
mode can be made at the beginning of any field but the reverse transi- 
tion is only permitted at the beginning of any even field, counting 


FIELD NUMBER, t 
o 1 2 3 4 5§ 6 7 





FIELD NUMBER,t 
o 1 2 3 4 5 


o 





DO o———__ + 0+—____- 9 —_——_ 60 >0 e 
| @——_» 0+ +0 et NN oN ON 
us if ANN ZN 
2 Os ie 0 a ao2 @. C ) e 
= 3 O<«————_ 9 —____»0<—___—_-@ 23 wa eae ™“e Fad 
4 Ot, Be *~ 
a4 o————> 0+ ——____ 9 —___ +0 Z4 < © 
z 5 eo—_—_> 0 +e —__>0 wes age “ Nye 
Zz 
a 6 o————_0 ——__ > 0 +9 = 6 e iar — sS 
oe ae ae 
4n (0,4) (1,5) 4 5 ) pispLayeD 
anti 2 3 (2,6)(3,7)) FIELDS Nt o (62) 2 (24) 4 (46) he 
ania} DISPLAYED 
ona (a) (b) FIELDS 
FIELD NUMBER, t 
oO 1 2 3 4 &§ 6 7 8 9 10 if 12 13 14 15 





LINE NUMBER,Y 
ann WN=— OO 
ce) 
| 
| 


STATIONARY 
(0,4) (15) 4 5 


MOVING 
o 1 6 (6,8) 8 


(-2,2)(-1,3) 2 3 (2,6) (3,6) 6 (6,8) 8 


(Cc) 


() DENOTES TEMPORAL AVERAGING fe) 
- DENOTES VERTICAL AVERAGING 
nN IS AN INTEGER 


+ @ 


8 (8,12)(813) 12 13 


(8,11) 10 


DNR OY 


‘0 ———_» 0 e —__—_» 0<— 


pao ge oe ee 


o-———___ > 9 +—_____0 =< —_—_8® 


-_ 6 +——_—-@ O° 


e—_——> 0 «6 
DISPLAYED 
STATIONARY FIELDS | 
| 
FOR LINES / 
an,anti |W” 
FOR LINES 
4n+2,4n+3 


14 (10,14)(11,15) 


DENOTES AN UNSAMPLED LINE 
DENOTES A SAMPLED LINE 


IS THE TEMPORAL COORDINATE 
OR FIELD NUMBER 


Fig. 2—Sampling patterns of lines and sequences of fields in the displayed 


picture for the different modes: (a) Stationary mode. 


(b) Moving mode. (c) A 


combination of modes (except near the mode changes, the fields represented in 
the output correspond with the temporal coordinate ¢). 
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from after the change from stationary to moving mode. This is done 
to prevent data being generated at a greater rate during two consecu- 
tive fields than can be handled by the channel. The mode changing 
information is negligible as only one extra bit of information is re- 
quired after every field. 

The delay network at the transmitter (Fig. 1) converts the effec- 
tively halved bit rate to a continuous bit stream of half the original 
rate; a similar network at the receiver reconverts the continuous bit 
stream back to alternate fully sampled fields or frames. The operation 
of these networks is described more fully in Section IV. 

Fig. 2c illustrates the line sampling pattern and the interpolation 
used when the system starts in the stationary mode (fields 0, 1, 2, 3, 
4, 5) and switches to the moving mode (fields 6, 7, 8, 9) and then 
switches back again. Also shown are the sequences of fields represented 
in the displayed picture. In both modes the lines displayed correspond 
either directly to the temporal coordinate or indirectly by temporally 
averaging from fields temporally equidistant from the current field. 
Near the changeovers some lines are misplaced temporally; for in- 
stance the unsampled lines of field 5 are replaced with lines from 
field 3 and 6 whose mean temporal position is 43. Different weights 
could be assigned to the signals of fields 3 and 6, but for simplicity we 
decided to subjectively test the coder using equal weight for all 
averaging. 

To experimentally test the coder we simulated Fig. 1 with the 
arrangement of Fig. 3. The picture format, scenes, and viewing con- 
ditions were the same as those used in the previous resolution exchange 
experiments, i.e., there were 271 lines per frame with a 2:1 line inter- 
lace and the frame frequency was 30 Hz; the sampling frequency was 
2 MHz. The scenes were head and shoulders views of a variety of 
subjects engaged in conversation varying from quiet to violent. The 
display raster was approximately 53 inches horizontally by 5 inches 
vertically and was viewed at 3 feet under ambient illumination typical 
of a well-lit office (about 70 footcandles). 

The primary coder is a 4-bit digital differential quantizer* whose 
quantizing levels are shown in Table I. The accumulated value, with 
a peak value of 127 levels, is fed to the frame memory via switch 8; 
(Fig. 38a). In the stationary mode §,; and Ss are switched at the line 
rate and S. is held at 0. The waveform L, used for switching at line 


* Although the quantizer has 17 levels and, strictly speaking, could not be 
transmitted as a 4-bit signal, certain combinations of levels were deleted so as to 
permit 4-bit transmission.+ 
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Fig. 3a—Experimental arrangement for testing simple interframe coder. Alter- 
nate fields or lines are fed to the frame memory via switch 8:. Switch Se selects 
either the previous field or lines from the previous frame for recirculation in the 
(delay line) frame memory. Adder A; performs the vertical averaging and adder 
Az performs the temporal averaging. Switch 8: selects the required output. 


rate, is shown in Fig. 3b as two waveforms of opposite phase. One 
phase applies to line numbers 1, 2, 5, 6, 9, 10, --: , or 4n and 4n + 1, 
where 7 is an integer, and the opposing phase applies to line numbers 
3,4, 7,8, °°: , or 4n + 2 and 4n + 3. Thus, during one field the value 
of L changes each successive line, and for a given line the waveform, 
L, changes polarity for each frame. Figure 3b also shows which fields 
are present at points A, B, C (Fig. 3a) for each of the two sets of 
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Fig. 3>—Waveforms a V, and S and fields present at seats A, B, C (Fig. 3a) 
and at the output for both sets of lines. In real time, waveform L switches at 
the line rate but is shown here as two waveforms, one for each set of lines 
switching at the frame rate. 


lines, for the same sequence of fields.and modes as described in the 
previous section and shown in Fig. 2. In the stationary mode (S = 0) 
the required output can be obtained either by continuously averaging 
the signals present at A and C or by switching S; at the line rate; the 
latter method is used because it allows greater simplicity when 
changing modes. 

In the moving mode 8, and S3 are switched at the field rate (wave- 
form V) and Fig. 8b again shows which fields are present at points 


TABLE I—QUANTIzZING LEVELS 


Level Decision Level Representative Level 

0 1/2 0 
+2 3 2 
+3 5 4 
+4 7-1/2 6 
+5 9 

11-1/2 

+6 16-1/2 14 
+8 25 
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A, B, C of Fig 3a; the output is formed by switching between C and 
(A + C)/2 at the field rate. To bring about the same sequence of 
outputs near the mode changes, as shown in Fig. 2; the waveform- 
controlling switches 8, and S: are changed as soon as S changes, but the 
change of waveform- controlling switch Ss is delayed by 1 frame: The 
sequences of fields present in the output in the experimental system 
(Fig. 3a) are also shown in Fig. 3b and correspond to the sequences 
shown i in Fig. 2c. 

In the experimental System, movement is aiicemel to be present if, 
during any field, 512 or more pels exhibit a frame difference simplitude 
greater than 15 levels (out of 255). To return to the stationary mode, 
less than 512 picture elements must have a frame difference amplitude 
greater than 15 levels during the even field where the field number is 
counted from the field in which the transition is made to the moving 
mode. 


Ill. RESULT 


The picture quality was consistent wih the results of the prelimi- 
nary experiments conducted on the separate modes as described 
above. The two faults of the moving mode, the aliasing patterns in 
areas of strong vertical detail and the slight jerkiness of fast-moving 
contrasty subjects, were still visible. Degradation due to the switching 
of modes was seldom visible and the effect of switching the whole 
picture instead of just the moving areas was not troublesome even 
when the plane of sharpest focus was. midway between the eee 
head and the curtains in the background. 

In some related experiments the switching of modes was confined to 
the moving area. The moving area detector examined a sequence of 
eight frame differences to decide whether the current picture element 
belonged in a moving area:! The resulting pictures of similar scenes 
viewed under the same conditions were no more pleasing and the 
movement detector setting was more critical; i.e., with a poor setting 
slowly moving sharp edges tended to break up due to intermittent 
mode switching. 


IV. DISCUSSION 


4.1 Delay Requirements 


- At the transmitter every second pel is. delayed by one line period, or 
one field period (according to the mode), to bring about a constant 
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bit rate. This delay can also be used by the movement detector for 
generating the required frame differences. Of course, the movement 
detector must now work on one-quarter as many points as before. 
But because of the global nature of the decision and the large num- 
ber of supra-threshold points required to change to the moving mode 
(512), reliable detection of movement could probably be performed 
with even less than this number of points. Thus, if the primary 
(EDQ) coder has:an output bit rate of 6 Mb/s (or 200,000 bits per 
frame) a delay store of 50,000 bits (half a field) is required at the 
transmitter. . 

At the receiver a similar delay is needed to convert the incoming 
constant bit rate to the required 6 Mb/s rate during aiternate lines 
or fields and a frame memory of 200,000 bits is needed to store the 
pels required for display. 

It is tempting to devise schemes in which the alternate fields or 
lines of data from different sources are multiplexed so that the 50 k-bit 
delay stores at the transmitter and receiver are unnecessary. However, 
such schemes have so far been less practicable than using the extra 
storage; the main difficulty lies in synchronizing any two cameras in 
an unsynchronized system. 


4.2 Recoding of Output 

When the primary decoder is located remotely from the interframe 
decoder, the unsampled fields or frames (depending on whether there 
is movement or not) must be recoded since, when different quantized 
differences (representative values) are averaged, the resulting differ- 
ence will not necessarily. belong to the set of representative values 
allowed by the in-frame decoder. A circuit to achieve this is shown in 
Fig. 4.* The averaged difference signal is added to the error term from 
the previously quantized level: the output of the quantizer is sub- 
tracted from the input to form the new error term. The advantage, 
previously mentioned, of reducing granular noise in the stationary 
mode is lost in the process of recoding. The effect of recoding was 
tested experimentally by differentially quantizing the output of switch 
S3 (Fig. 3a) before decoding and displaying the signal. There was no . 
appreciable increase in noise (over the primary coded signal) due to 
recoding either in the stationary mode or the moving mode. 

In a switching system it may be desirable to convert and: reconvert 
the signal between the full rate and the half rate many times. This 


* This is Cutler’s error feedback coder. 
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Fig. 4—Recoder to convert averaged weights to standard weights. 


can be achieved with no loss of quality other than that introduced 
in the initial conversions by having the full-rate signal contain one 
code word in each field to indicate the sampling mode; thus no further 
movement detection need be employed and subsequent interframe 
encodings are simpler than the initial one. Decoding will remain’ 
unchanged. 


4.3 Comparison 

Compared with other interframe coders®’ the simple interframe 
coder has a higher bit rate and, under certain conditions, a poorer 
picture quality. However, there are certain offsetting advantages 
which are described below. 

One advantage is the relatively low memory requirement (300,000 
bits for the transmitter and receiver combined). Other existing inter- 
frame coders require 530,000 bits of storage for the frame memory 
at the transmitter and the same again at the receiver, and sophisti- 
cated buffers are also required because of the randomness in the 
generated bit rate of these coders. It is possible to use smaller stores 
in such coders by requantizing the input signal. This operation pro- 
duces a small loss in picture quality and also it is not yet clear what 
effectit will have on recoding the signal.® 

The simple interframe coder can be used with any primary coder 
which uses only previous pels along the line for prediction (see, for . 
example, Refs. 4, 9, 10). The primary encoding stage may well be 
located at the first level of switching while the frame-to-frame coding 
section may be located at a higher level in the switching hierarchy. 
With such an arrangement, the secondary encoding stage can be 
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bypassed altogether when less than half the Picturephone trunk con- 
nections in a group of trunks is not in use. 

The effect of channel errors on the received picture is probably 
about the same as in a standard DPCM system. If element-to-element 
differential encoding were used in the primary encoder, errors would 
be confined to a single line (in the received signal) as the accumulators 
at the transmitter and receiver are reset at the end of each line.* If 
the movement detector is in the stationary mode, the effect of an error 
in one frame will also be displayed at half-amplitude in neighboring 
frames; but on the other hand, no errors will originate in the neigh- 
boring frames. Similarly, if an error occurs in the moving mode, the 
effect will appear at full-amplitude in the sampled field and at quarter- 
amplitude in the two adjacent lines in each neighboring field, but the 
neighboring fields will contain no indigenous errors. Most error de- 
tection techniques that can be applied to the primary encoding section 
can be used without modification when the frame-to-frame coding 
section is added (e.g., the check-summing method of error detection 
suggested for the differential quantizer) .* 

Apart from one mode-bit transmitted every second field the coder 
has no change-of-mode words, addresses, start-of-line words, or start- 
of-frame words. In a coder using these special words, an error in the 
received data, if it occurs in a special word, can be especially trouble- 
some. 


4.4 Encoding Using the Previous Line 

There may be occasions when the primary coder uses pels from the 
previous line (in the same field) as well as from the present line for 
predicting the value of the current pel (see for example, Ref. 11). The 
stationary mode described above is now unsatisfactory because only 
every other line in each field is available at the receiver. We therefore 
modified the apparatus of Fig. 3 to evaluate a coder in which alter- 
nate frames are transmitted in the stationary mode and temporal 
interpolation is used to replace the unsampled frames. The moving 
mode is unchanged and whole fields are now left intact in both 
modes. The change from a stationary to a moving mode can now be 
made only at the end of a frame rather than at the end of a field, as 
before. In addition, the delay stores for converting to and from a con- 
stant 3Mb/s rate are twice as large as required before, but the total 
required memory is still less than half that of other existing inter- 
frame coders. Experimental results with this coder gave pictures 
which were as satisfactory as those already described. 
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Transmitting Television as Clusters 
of Frame-to-Frame Differences 


By J. C. CANDY, MRS. M. A. FRANKE, B. G. HASKELL, and 
F. W. MOUNTS 


(Manuscript received February 16, 1971) 


A number of redundancy reduction techniques are used in a coder 
that is about eight tumes more efficient than simple PCM. The coder 
ts capable of transmitting Picturephone® signals at an average rate of 
one bit per picture-element (2 Megabits per second). When there is 
movement wn the scene, most transmission time is devoted to the parts 
of the picture that change significantly. The data are generated wreg- 
ularly but the data flow is smoothed prior to transmission in a buffer 
that holds about one frame of data. The redundancy reduction tech- 
niques used and the behavior of the coder are discussed both from an 
intuitive and from a statistical viewpoint. 

The positions of elements that change are signaled by addressing 
the first element of a run of changes and marking the end of the run 
with a special code word. The changes of luminance are transmitted — 
as frame-to-frame differences using variable-length code words. When 
rapid, motion makes the buffer more than a quarter full, only dif- 
ferences for every second element are transmitted, the values of the 
intervening changed elements being set equal to the average of their 
neighbors. If the buffer continues to fill, the threshold that determines 
which changes are significant is raised from 4/256 to 7/256 of the 
maximum signal value. When violent motion causes the buffer to fill 
completely, replenishment is stopped for about one frame while the 
buffer empties. 

Subsampling and raising the threshold are not objectionable because 
viewers rarely detect the small impairments introduced in moving 
images. Observers are critical, however, of small impairments in 
stationary scenes. Thus, to maintain high quality in stationary areas, 
the entire picture is forcibly updated every three seconds by trans- 
mitting 8-bit luminance values for three lines of every frame. 
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A record of the coder’s behavior is available as a 16-millimeter 
movie film. 


I. INTRODUCTION 


The introduction of Picturephone service has stimulated a great 
deal of interest in techniques for transmitting video signals more 
effectively than the way normally used for broadcast television. Since 
digital transmission over large networks has been found to be more 
economical for video signals than analog transmission, digital coding 
is presently receiving most of the attention. One of the best known 
improvements on simple PCM is differential coding, which takes ad- 
vantage of the similarity between consecutive samples along a scanning 
line. Differential coding is simple and yields a bit-rate savings of 
around 50 percent depending on the picture quality desired. 

This paper describes a method that makes use of the similarity of 
pictures in successive frames as well as the similarity of adjacent 
samples. Until recently, using the correlation between frames to im- 
prove efficiency required complex and costly equipment; but now, 
with low-cost memories and integrated circuits, it is economically 
attractive and seems to be particularly suitable for the visual tele- 
phone application. 

All techniques that reduce the required channel capacity by antici- 
pating a redundancy in the signal have a limitation, in that signals 
which do not contain the expected redundancy cannot be transmitted 
unless some alternative process is provided for them. This limitation 
is possibly the reason why redundancy-reducing techniques have not 
been used extensively for broadcast television, where there is need to 
display unusual scenes to attract viewers and entertain them. With 
visual telephone, however, there is a great need for economical trans- 
mission because each signal will not have an audience of millions to 
share its costs. Indeed it is unlikely that users will want to pay for 
transmission capabilities that are rarely needed. 

Recent work? has demonstrated how differential coding enables 
Picturephone signals to be transmitted using three or four bits per 
picture-element. This is about three bits less than simple PCM requires 
to give comparable quality. Inherent in this technique is a restriction | 
on the amount of detail in the scene that can be reproduced faithfully. 
A much greater saving has been obtained by F. W. Mounts,*® using 
coders that signal only the changes between successive frames. These 
coders require only one bit of channel capacity per picture-element, 
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the restriction in this case being on the amount of movement that can 
be accepted in the scene. 

It is anticipated that users of Picturephone service will want to 
display scenes containing fine detail much more often than scenes 
containing rapid motion; and so they would tolerate blurring of moving 
objects more readily than a similar loss of spatial detail in stationary 
scenes.’ Consequently, ways are used for exploiting the correlation 
between successive frames as well as that between successive elements 
in order to effectively encode Picturephone video signals. 

The work to be described here, which is a continuation of F. W. 
Mounts’ original work, improves the coder so that more motion can 
be tolerated while using a much smaller buffer than was originally 
required. (A coder that uses no buffer but requires a data rate of 1.5 
bits per element is described in Ref. 8). The techniques used Here rely 
primarily on two phenomena: 


(t) Large areas of the average Picturephone scene change very 
little or not at all between successive frames. Thus, in each 
frame only a small amount of new information is required to 
specify these areas to the receiver. 

(77) More information is needed for specifying areas of the picture 
that change. But they need not be reproduced at the receiver 
with as much spatial resolution as stationary areas because 
the storage properties of the camera tube tend to blur move- 
ments, and viewers cannot accurately resolve fine details that 
are in motion. 


The coder transmits fresh information describing the stationary 
parts of the picture at least once in three seconds. At the receiver, 
this information is retained in a frame memory from which the display 
is derived: Portions of the picture that change significantly are up- 
dated as soon as they are detected, but sometimes with slightly lower 
resolution than in the stationary areas. 

There follows a description of the proposed coding strategy and an 
account of a simulation of the coder that has been demonstrated. * 


Il. CONDITIONAL REPLENISHMENT 


In Ref. 6 F. W. Mounts called his method “Conditional Replenish- 
ment.” He transmitted only the information necessary to describe the 


* A demonstration of the system processing a live scene was shown during the 
Keynote Session of the 1970 IEEE International Convention. 
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intensity of elements that changed between frames. The implementa- 
tion used two frame memories, one at the receiver and another at the 
transmitter, to store the required reference picture. The incoming 
signal was compared element by element with the reference picture. 
When the magnitude of the frame-to-frame difference was greater than 
a certain threshold, it was regarded as significant, and the new signal 
value replaced the reference value in the memory. The new value was 
also transmitted to the receiver, where it updated the contents of the 
receiver frame memory. In this way the receiver memory was made 
to track the transmitter memory, thereby providing a signal for dis- 
play. 

Every time an element value changed, two numbers were trans- 
mitted, one describing the new amplitude and the other describing the 
location of the element. In addition, synchronizing words were sent at 
the start of every frame and every line, making it necessary to address 
only the horizontal position of changed elements. 

For most television pictures, the technique of sending only data 
relating to elements that change gives a considerable saving of trans- 
mission cost. There is a saving when a large fraction of the scene is 
stationary or when moving objects contain large areas that are uni- 
formly bright; in either case little transmission is required. Part of 
the saving is offset by the need to define the position of changed 
elements in addition to their new values. Nevertheless, F. W. Mounts 
found that, by using a large buffer to smooth the highly irregular data 
flow, visual telephone signals could be transmitted using, on the aver- 
age, only one bit per picture-element. 


III. THE PICTURE FORMAT AND TYPICAL STATISTICS 


The video signal, which is very similar to the Picturephone video 
signal, is derived from a picture scanned with 271 interlaced lines at 
30 frames per second. An example is shown in Fig. 1. The bandwidth 
is nominally 1 MHz, and elements are sampled at about 2 MHz to 
give 248 samples in a line period. The visible portion is about 13 cm 
high and about 14 cm wide; it contains 255 lines each with 207 ele- 
ments. For practical convenience the signal is coded as 8-bit PCM so 
that all processing can be performed digitally in real time. The posi- 
tions of elements in the frame are described by signaling the start 
of every line and addressing their positions only within the line. If an 
8-bit address word is used, 206 of its 256 values are needed to address 
the visible elements on a scanning line. In the simplest case, only three 


TV REDUNDANCY TECHNIQUES 1893 








Fig. 1—A typical picture. 


others are needed: one to mark the start of an even field, one to mark 
an odd field, and one to mark the start of a new line. In an operational 
system, however, all of the remaining 50 values would probably be 
used to help protect the synchronization from error and to signal 
special modes. 

The frame contains a total of 67,208 elements of which 52,785 are 
visible. Usually only a small fraction of these elements change be- 
tween any two frames.?°1! The ordinate in Fig. 2 shows the number of 
elements that change in a frame-time by more than 3/256 of the 
maximum signal amplitude. This graph shows 18 seconds of signal 
measured when there was an unusually large variety of motion in the 
scene. For convenience the scale is classified into five ranges cor- 
responding to the viewer’s subjective judgment of the activity. Figure 
3 is a probability density function of tne number of changes in a 
frame. It shows how often the various kinds of activity can be ex- 
pected in typical Picturephone scenes. The data were collected from 
75 minutes of picture material which included head-and-shoulder views 
of one and occasionally two persons, camera panning, subjects walk- 
ing through the field of view, and pictures of printed text. On the 
average, less than 4 percent of the elements change in a frame-time. 
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Fig. 2—The number of elements in a frame that change by more than 3/256 
of maximum amplitude, plotted over a period of 18 seconds when there was a 
at of motion. Each frame contains 67,000 elements of which 53,000 are 
visible. 


IV. THE BUFFER REQUIREMENT 


When only 4 percent of the elements change, it is fairly easy to 
design a coder whose data-rate is less than one bit per element; there 
are times, however, when the rate is much higher. Thus, a buffer must 
be used at the transmitter to smooth the highly irregular flow of data 
from the coder and feed it to the channel at a constant rate. Another 
buffer at the receiver performs the inverse operation. 

It is evident in Fig. 2 that peaks of high activity last for an appre- 
ciable part of a second; thus, a buffer to smooth them out would intro- 
duce an intolerable delay into the signal path. To prevent delay and 
echoes in the accompanying voice channel from being a distraction to 
users of Picturephone service, the total delay between talker and 
viewer should be less than a third of a second;?? i.e., ten frames. This 
sets an upper limit on the size of the buffer. 

Not much is gained, however, by using a buffer this large. The 
buffer should be at least large enough to smooth the data over one 
field-time (two interlaced fields per frame are used throughout), since 
the changing elements are usually very unevenly distributed within a 
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picture. This is illustrated in Fig. 4 by bright dots placed in the 
position of elements that have changed in the picture. There seems 
to be little advantage, however, in carrying data over from one field 
to another; unless it can be accumulated for many frames, the activity 
is usually very similar in successive fields. To illustrate this similarity, 
Fig. 5 gives the probability that the variation in the number of sig- 
nificant changes in successive fields exceeds a stated amount: the 
number of changes in a field differs from the number in the previous 
field by more than 500 in only 5 percent of the fields measured. 
Another study™ has also confirmed that the buffer should indeed be 
capable of storing data over a field-time and not very much larger. 

It is proposed that the buffer be capable of storing the amount of 
data that is transmitted in two field-times. This is more than enough 
to smooth the irregularities caused by spatial distribution of activity 
and also allow latitude for controlling the coder. Use of this buffer, 
which is about one tenth the size of the one used by F. W. Mounts,® 
requires that the data fed into it in a frame-time be approximately 
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Fig. 3—The probability density function of the number of changes in a frame 
for 75 minutes of signal. The degrees of motion in the scene are indicated. 
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Fig. 4—A display of dots marking elements that have changed in a frame 
during very active motion: about 12,000 changes occurred in this frame. The 
picture is that shown in Fig. 1. 


equal to what can be transmitted by the channel in a frame-time. To 
satisfy this constraint on the coder, the average number of bits used 
to signal a change in an element value must decrease as the activity 
in the scene increases. For example, Table I gives an estimate of the 
number of bits available to code each change for various amounts of 
activity if the transmission rate is 67,000 bits per frame (one bit per 
element for Picturephone use) : 

From Fig. 3 and Table I, it is seen that conditional replenishment 
accommodates pictures of normal activity, i.e., 70 percent of the 
scenes, when transmitting two 8-bit words (address and amplitude) 
for every change. It will be shown how to extend the range by en- 
coding, with fewer bits. Elements will be addressed in clusters instead 
of individually, and their changed amplitudes will be transmitted as 
frame-to-frame differences using four bits on the average instead of 
the eight bits needed to define a completely new amplitude. 


Vv. ADDRESSING ELEMENTS IN CLUSTERS 


In the original conditional replenishment system, about half of the 
transmitted data were used for addressing the positions of changed 
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Fig. 5—The probability that the difference between the number of changes in 
one field and. those in the next exceeds X. 


elements. It is observed in Fig. 4 that significant changes usually occur 
in clusters crowded near brightness edges of moving objects. Thus, it 
is profitable to describe the position of changed elements in groups 
rather than individually. To explore such methods, the magnitudes of 
frame-to-frame differences for one whole frame were stored in a 
memory during a period of active movement. Figure 6 shows an 
example of how changes are distributed among runs of various lengths. 
In this example 10,547 elements changed requiring 84,376 bits to 
address them individually each with eight bits. Using cluster coding, 
the start of each run is addressed with an 8-bit word, and the end of 


TABLE I—NuMBER oF Bits PER CHANGE AVAILABLE AT VARIOUS 
DEGREES OF ACTIVITY 











No. of Changes in a Average Number of 
Description of the Frame of 67,000 Bits Available per 
Activity Elements Change 

Inactive < 2,000 >30 
Normal ~ 4,000 ~16 
Active ~ 8,000 =~ 8 
Very active 13 ,000 ~ 5 
Violent >16,000 <4 
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Fig. 6—A histogram of the number of contiguous runs of various lengths in a 
frame. The motion was very active. 


the run is signaled with a special 4-bit word. With this method the 
1683 runs of changed elements in Fig. 6 could be positioned using only 
20,196 bits; a saving of 64,180 bits, (about 75 percent). It will be 
shown that an additional saving is possible. 

Addressing runs is inefficient when applied to isolated changes in 
the picture, because 12 bits are used to fix their positions. However, 
it was observed that most isolated changes can be ignored without 
spoiling the picture quality. The following criterion provided ac- 
ceptable quality: any change in an element, no matter how large, was 
regarded as insignificant if 1t were immediately preceded and followed 
by two elements that changed by less than the threshold of sig- 
nificance. For the data in Fig. 6, 187 changes could be ignored under 
this criterion. 

Another improvement is obtained by coalescing runs that are 
separated by a small number of unchanged elements. For example, 
when using the cluster-coding just described and 4-bit words to signal 
changes of amplitude, it is preferable to continue a run that is inter- 
rupted by less than four unchanged elements, since it requires 12 bits 
to end a run and restart a new one, but only 4, 8, or 12 bits to continue 
coding the insignificant changes between runs. The technique of 
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coalescing runs and ignoring isolated changes will be referred to as 
“bridging.” Bridging often improves quality as well as efficiency 
because some elements, changing less than the threshold, are updated 
and isolated noise impulses are suppressed. It should be noted that 
the suppression of isolated changes takes place before bridging. Per- 
forming the operations in the reverse order would be somewhat less 
efficient. 

Figure 7 shows the bright dots covering elements that would be 
coded using the techniques described above and Fig. 8 shows dots 
at the start of every bridged-run (cluster). Figure 9 shows the dis- 
tribution of the cluster lengths for the data in Fig. 6. (Notice how 
bridging significantly lengthens the runs, reducing the number of runs 
from 1,683 to 736 while increasing the number of elements in the 
runs only from 10,410 to 11,886.) To transmit these data, about 47,544 
bits would signal 4-bit amplitude changes and 8,832 bits would 
signal addresses. Figure 10 is plotted the same as Figs. 6 and 9 but 
with the threshold raised to 7 parts out of 256. Here the changed ele- 
ments are grouped more closely together, and there are fewer of them. 









































Fig. 7A display of cismvents that would be transmitted using cluster coding 
for the changes in Fig. 4. Isolated changes are ignored and gaps of three or less 
are bridged. 


1900 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 





Fig. 8—The dots show the start of every cluster. 


This frame requires 32,516 bits for amplitudes and 9,132 bits for 
addresses. 

In general, cluster coding reduces the number of address-bits sig- 
nificantly. Application of cluster coding to the data used in Fig. 3 is 
illustrated in Fig. 11. This graph shows that the number of clusters 
increases much slower than does the number of changes in the picture. 
Indeed when there are more than 10,000 changes, the number of 
clusters is relatively constant; therefore, cluster coding is very effec- 
tive for smoothing the generation of address bits. The next two 
sections describe methods used to reduce and regulate the generation 
‘of bits needed to signal amplitude-changes. 


VI. FRAME-TO-FRAME DIFFERENCES 


6.1 Amplitude Distribution of Difference Signals 


While discussing cluster coding, we allowed 4-bit words for describ- 
ing frame-to-frame differences of the video signal. This difference 
signal is generated at the transmitter when the reference is subtracted 
from the input. Its amplitude can range between +255 units but 
usually it is small, as Fig. 12 demonstrates. Figure 12 shows how the 
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magnitudes of the significant frame differences are distributed statis- 
tically for typical Picturephone scenes. The motion in the scene was 
normal for curve (a) and very active for (b). The circuit used for 
generating the frame differences is shown in Fig. 13. The quantizer 
was bypassed while the data was being taken. 

The frame-to-frame difference signal may be quantized and trans- 
mitted with much less than the eight bits needed to send the absolute 
amplitudes. For example, when the quantizer is used in the circuit 
of Fig. 13 (output levels correspond to the eight divisions of the 
abscissa of Fig. 12), the displayed picture has good quality for 
still and slowly moving scenes. However, there is noticeable distortion 
of rapidly moving edges: a sharp, moving edge appears as a cascade 
of smaller steps as each increase of luminance corresponds to the 
magnitude of the outer quantization level (438 units). This distortion 
is eliminated by providing more quantization levels, for example 64 
levels can give excellent reproduction for all types of activity in the 
scene. 

Another important reason for using more than 16 levels is to reduce 
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Fig. 9—A histogram of the number of clusters of various lengths for the runs 
Seer Fig. 6. Isolated changes are ignored and gaps of three or less are 
ridge 
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Fig. 10—The effect of raising the threshold from 4/256 to 7/256 on the runs 
and bridged-runs given in Figs. 7 and 9. Changes that equal or exceed the thresh- 
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Fig. 12—The probability distribution for the amplitude of frame-to-frame 
difference signals during normal and active motion. Its maximum is 255 units. 


the residual discrepancy between the reference and the input. The 
stored reference is updated by addition of the quantized difference 
signal; therefore, its updated value will differ from the input by an 
amount dependent upon the quantizing scale even after movement has 
ceased. If, in the next frame time the discrepancy exceeds the threshold, 
transmission capacity will be used to correct it even though the input 
signal remains unchanged. This need for additional transmission, after 
large changes have ceased, makes the system inefficient for certain 
types of input. Therefore, the coder uses a quantizer that can represent 
any change to within +4 units of its true value. The quantizing scale 
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Fig. 13—The circuit used for measuring frame-to-frame difference. 
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has 64 levels distributed thusly: 


+1, +5, +10, +15, +20, +27, +35,--- 


increasing in increments of +8 up to +235. 

A signal quantized to 64 levels can be signaled with 6-bit words, but 
since the outer levels are rarely used, we use only a 4-bit word to 
signal the innermost 14 levels. The 4-bit word has sixteen distinct 
values. One of the remaining two values is used to specify the end of 
a cluster. Whenever the magnitude of the next frame difference exceeds 
39 units the remaining 4-bit word is used to tell the receiver that the 
next 12 bits should be decoded as two 6-bit words rather than three 
4-bit words. These 6-bit words specify the frame differences using the 
full set of 64 levels. Six-bit words continue to be transmitted in pairs 
until the last word represents a level lying in the innermost 14 levels; 
then we revert to 4-bit words. 

Figure 14 shows a picture with bright spots marking the elements 
that are updated with 6-bit words; they occur in groups near large 
rapidly moving edges in the picture. On the average, 6-bit words are 
used for less than 10 percent of the updated elements. We find that 





Fig. 14—The changes that are signaled with 6-bit words for the data in Fig. 4. 
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this method of transmitting quantized frame differences requires only 
about half the transmission capacity needed for sending the true 
picture-element brightnesses as 8-bit PCM. However, when differ- 
ences are transmitted, a mechanism is needed for preventing trans- 
mission faults from introducing permanent errors into the signal. This 
mechanism is described in section 8.2, ‘Forced Updating.’ 


6.2 Subsampling 

The techniques described in the previous sections allow pictures to 
be transmitted using, on the average, less than six bits per changed 
element. Thus, pictures with active motion can be accommodated. 
However, to be useful, the range of the system must be extended even 
further. This extension is obtained by making use of the fact that in 
moving parts of the picture the sampling frequency can be halved, 
yet not cause noticeable degradation.” 

A measure of the activity in the scene is obtained by monitoring 
the number of bits held in the buffer. When it exceeds a prescribed 
amount, indicating active motion, we switch to a subsampling mode 
of operation. In this mode only the changes in every other element in 
a cluster are transmitted and used to update the stored reference pic- 
tures. The amplitudes of the intervening elements in the cluster are 
set equal to the average of the two neighboring values. Table II shows 
the states of a sequence of sixteen elements having, initially, reference 
values Ay, Az, ***, Aig. The new input values are assumed to be 
those listed on line (b) where, in every case, the change from value A 
to value B exceeds the threshold. Line (c) shows which difference sig- 
nals are transmitted in the normal mode of operation. The change A; 
to Bs, an isolated change, is not transmitted, but the stationary ele- 
ment value Ajo is transmitted in order to bridge the cluster. Line (d) 
shows the new reference, the apostrophe signifying that a quantiza- 
tion has occurred in the value. If the coder were operating in the sub- 
sampling mode, the transmitted differences would be those marked 
on line (e), and the new reference would be those values on line (f). 
Here, the symbol C’ represents the average of values on either side of 
the element; for example, 

_ Bet Bio 
i: 2 

When subsampling, the set of elements whose values are considered 
for transmission are arranged in a fixed checkerboard pattern in the 
raster. This pattern is unchanged from frame to frame in order that 
discrepancies between the reference and the input caused by the inter- 


Co 


TABLE II—SELEcCTION oF ELEMENTS FOR TRANSMISSION IN BoTH THE NORMAL AND THE SUBSAMPLE MODE 


A, represents the original reference amplitude of the nth element 
B, represents a new value of an amplitude 

B,’ represents a quantized amplitude 

C,, represents an interpolated amplitude 
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polation shall not instigate a transmission; moreover, a changing pat- 
tern is subjectively unpleasant. 

Using the subsampling technique, very active motion can be ac- 
commodated in the scene, because, on the average, less than four bits 
are needed to signal each significant change. Although the spatial 
resolution is effectively reduced by a factor of two in regions of 
motion, it is almost impossible to see. Indeed, additional loss of resolu- 
tion can be tolerated to further extend the range of the coder. This 
extension is obtained by raising the threshold® that determines which 
changes are significant. 


6.3 The Threshold . 

Normally the threshold is set to accept changes greater than or 
equal to 4 parts out of 256; this gives excellent reproduction of motion, 
yet prevents small amounts of noise from needlessly using transmis- 
sion capacity. When there is active motion in the picture and the buffer 
starts filling in spite of subsampling, the threshold is raised one unit 
at a time as the buffer fills up, finally reaching a maximum threshold 
of seven units. 

When the threshold is at a high value the unreplenished changes 
appear as a stationary noise, which has the appearance of a dirty 
windowpane placed before the scene. With threshold values in excess 
of eight this noise is clearly visible in dark, moving areas of the 
picture. When the threshold is less than eight the picture impairment 
is small, and with thresholds less than five the noise is unnoticeable. 
Figures 15 and 16 show how raising the threshold reduces the number 
of changes that are counted as significant. Besides reducing the trans- 
mission requirement, raising the threshold also makes the coder more 
tolerant of noise in the input video. In fact, the coder is capable of 
processing signals that have been contaminated with noise whose 
root-mean-square value is 35 dB less than the peak signal value. Such 
high noise levels would be unacceptable for commercial service; we 
expect noise to be 45 dB less than the signal. 


VII. BUFFER OVERLOAD 


The techniques we have described so far allow the majority of 
Picturephone scenes to be encoded and signaled at an average rate of 
one bit per picture-element. Only occasionally does the motion be- 
come so violent that the system is congested. Such situations occur 
when the camera is panned over a very detailed scene or when the 
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4/256 8/256 


Fig. 15—A display of the elements whose changes equal or exceed thresholds 
of 1/256, 2/256, 4/256, 8/256. 


subject suddenly stands up close to the camera; these instances are 
usually of short duration. To prevent the buffer from overflowing at 
these times, we simply inhibit all replenishment of memories at the 
end of the cluster during which the buffer becomes 99 percent full. 
Replenishment is inhibited until the buffer content falls to about 
2,500 bits, which takes about one frame-time. Afterward, coding 
resumes with the constraint that it remain in the subsampling mode 
with a threshold of seven for at least one entire field. 

While replenishment is inhibited, the data previously stored in the 
memory are displayed again, and the motion in the picture becomes 
somewhat jerky.1* Viewers have not found the operation to be very 
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objectionable, and some people have grown accustomed to seeing 
jerkiness in vigorous movement. However, this is one aspect of the 
coder that can probably be improved by using the vertical correlations 
in the picture® in much the same way as subsampling uses the horizonal 
correlations. 


VIII. FIXED BACKGROUND TRANSMISSION 


8.1 Start-of-Line Codes _ 

In previous sections we have concerned ourselves with describing 
techniques for signaling changes which occur irregularly in the picture. 
Signaling addresses and amplitude changes use about 90 percent of 
the transmission capacity; the remaining 10 percent represents the 
synchronization and forced updating. These data are transmitted in- 
dependently of picture content unless the buffer fills; then the forced 
updating is interrupted for a frame-time but the synchronization is 
always transmitted jin order to keep the transmitter and receiver in 
step. . 

We have already mentioned that the start of each new line of the 
raster is signaled with a selected value of the 8-bit address words. (A 
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Fig. 16—A histogram of the number of changes in a frame that exceeds various 
thresholds for typical scenes. 
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suitable choice of the code will not be discussed in this paper, but 
notice that 50 code words are available after 206 have been reserved 
for addressing the positions of elements on the linc.) Changes from 
the normal to the subsampling mode will be signaled by a change in 
the start-of-line code. A special code is also used at the start of the 
first line in each field and at the start of lines that are to be forcibly 
updated. | 

- Using 8-bit words for marking starts of lines requires about 2,000 
bits in each frame-time, which is about 3 percent of the intended 
transmission capacity. 


8.2 Forced Updating 

Forced updating is a process for periodically rewriting the entire 
picture with full amplitude values. It serves two purposes: To insure 
a very high quality reproduction of stationary parts of a scene and to 
correct errors introduced during transmission by periodically aligning 
the contents of the transmitting and receiving memories. 

This forced updating is accomplished by transmitting in each frame- 
time, three lines of the picture as 8-bit PCM. The three lines are 
evenly spaced in the raster and are moved up each frame-time so that 
the entire picture is updated in about three seconds: Thus, less than 
three seconds after a portion of the picture became stationary it ac- 
quires the quality obtainable with 8-bit-PCM transmission. Nearly 
perfect reproduction can be obtained for graphics and similar scenes 
that display spatial detail rather than movement. 

Forced updating of three lines in each frame uses about 5,000 bits 
or about 8 percent of the transmission capacity. Thus, about 11 percent 
of the transmitted information is fixed. 


IX, PREVENTING THE BUFFER FROM EMPTYING 


We have described ways for preventing buffer overflow by reducing 
the rate at which data are generated, always taking care to maintain 
a satisfactory quality in the transmitted picture. The problem of pre- 
venting the buffer from becoming empty is much easier to solve be- 
cause there are many ways of acquiring data in order to improve the 
quality of the picture. We have chosen to use the forced updating 
mechanism for this purpose. The next line of input data is “forcibly 
updated” whenever the buffer content falls below 2,500 bits. When 
there is no motion, the entire picture is updated, within a quarter of a 
second, by this technique. This helps to obtain a high-quality repro- 
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duction of stationary scenes, such as graphic material, and always 
provides data for the buffer even during the vertical fly-back interval. 


X. THE SIMULATION TRANSMISSION EXPERIMENT 


Having described the coding strategy, we will now describe a lab- 
oratory simulation of the system. 

The simulator used produces a received picture which has all the 
characteristics of one processed by a real system except that effects of 
transmission delay and digital transmission error are not included. 

The simulator is a more complex version of the test circuit shown 
in Fig. 13; its main features are given in Fig. 17. The video signal 
enters at the upper left where it is sampled and coded as 8-bit PCM. 
This digital coding is required because it is impractical to build analog 
memories and analog processing circuits that are sufficiently accurate 
to perform the operations previously described. The coded input signal 
is compared with the reference signal emerging from the frame 
memory. The difference is then fed to a quantizer and to a threshold 
circuit that determines whether or not the change is significant. If 
the difference is insignificant, the threshold circuit tells the control 
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Fig. 17—The laboratory system used for simulating the behavior of the coder. 
It operates directly on real video signals. 
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to open switch B to prevent the quantized signal from being fed to 
the buffer or to the memory. When the first significant difference is 
encountered, starting a cluster, the control closes switch B so that 
the quantized difference can be fed toward the buffer and to the adder. 
The adder combines the difference with the reference that is being 
circulated from the output of the frame memory. Thus, for significant 
changes the reference picture is brought up-to-date by adding the 
quantized difference to it. For insignificant changes, it remains un- 
changed in the memory since zero is added to it. 

The purpose of the delay included before switch B is to give the 
control circuit enough time to decide whether to start or end a cluster. 
Recall that changes not accompanied by other changes are ignored, and 
clusters are ended only when the next four picture-elements change 
less than the prescribed threshold. The delay in the memory feedback 
path insures that the reference is added to the corresponding quantized 
difference. 

In a real system the significant frame-differences would be coded 
and transmitted to the receiver to update the contents of its memory 
as is done at the transmitter. But in the simulator we avoid the need 
for a transmission link and a receiver by taking the display signal 
from the input of the frame memory. Although we have avoided the 
need for transmission, we still need an indication of the fullness of the 
transmitting buffer in order to control the behavior of the coder as 
the buffer fills up. This is accomplished by counting the number of 
bits that would be sent to a buffer in a real system, and subtracting 
the amount that would be transmitted. The circuit labeled “bit assign- 
ment” determines the number of bits that should be fed to the counter 
in order to simulate a buffer. It receives the quantized differences, and 
determines the number of bits needed to code them. It also receives 
signals from the control circuit which mark the start and end of each 
cluster of changing elements and a signal to indicate when forced up- 
dating occurs. From the address generator it receives signals that mark 
the start of each scanning line. 

A count of the number of bits in the buffer is fed to the control 
circuit where it is used to determine the operating mode of the coder. 
The diagram in Fig. 18 illustrates this control function; it shows that 
the threshold is raised from four to five, and then to six, and finally 
to seven as the count increases from 10,000 to 20,000, to 35,000, and 
to 50,000. At a count of 65,000 all replenishment is stopped until the 
count falls below 2,500. The system starts subsampling when the count 
exceeds 20,000 and does not return to full sampling until the count 
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Fig. 18—A representation of the control functions associated with the buffer. 


falls to 10,000. * The subsampling is accomplished by simultaneously 
opening switch B and moving switch C to its lower position for every 
other element in the cluster. The signal fed to the memory and to the 
output is then an interpolation of adjacent values. 

Switch A is closed to bypass the quantizer and switch C removes 
the interpolation when lines of the picture are ‘force updated” with 
their true values. 


XI. BEHAVIOR OF THE CODER 


11.1 Qualitative Behavior 

When the simulator processes the signal in a Picturephone trans- 
mission, very little impairment can be seen in the received picture. 
Indeed the received picture appears to be the same as the transmitted 
one except for two conditions which rarely occur for normal scenes. 
One is the jerkiness introduced into scenes that change violently over 
a large area; the other is a moiré pattern that can be seen when a 
graticule of vertical bars moves enough to require the subsampling 
mode. The moiré pattern, or aliasing, is a transient that vanishes as 
soon as the graticule becomes stationary and full sampling resumes. 


11.2: Quantitative Behavior 

Figure 19a shows the probability of buffer content exceeding Q 
bits with a subject moving vigorously. Figure 19b shows the cor- 
responding probability density function. We see that, most of the 
time, the buffer is almost empty; only two percent of the time is it 
more than three quarters full. This indicates that there is considerable 
advantage in sharing buffers and transmission channels amongst sev- 
eral coders. Figure 20 shows how many bits are used, on the average, 


* Choice of these parameters is not critical. 
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Fig. 19a—The probability of the buffer content exceeding Q bits. 
Fig. 19>—The corresponding probability density function. 
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Fig. 20—The average number of bits transmitted plotted against the nuuseet 
of significant changes. The probability of the changes occurring is given in Fig. 2 
The ordinate of (a) represents the fixed transmission. The separation (a)- (b) 
represents the addresses, c.f. Fig. 11; (b)-(c) represents the amplitude difference ; 
and (c)-(d) represents the forced transmission. 


for the various functions of the coder at different levels of activity in 
the scene. The ordinate of the straight line (a) represents the fixed 
transmission. The distance between curves (a) and (b) represents 
the bits used for addressing clusters of changed elements, and the 
distance between (b) and (c) represents the bits used for signaling 
their changes of amplitude. The change in slope of (c) near 12 kilo- 
changes per frame and the subsequent lower rate of rise of the curve 
is caused by the subsampling mechanism. The distance between curves 
(c) and (d) represents the bits that are generated by forcibly updating 
lines of the picture in order to prevent the buffer from emptying. On 
this graph we see that the data generated per frame is less than the 
transmission rate, provided no more than 12,000 elements change in a 
frame time; i.e., 90 percent of the time. When more than 12,000 
elements change, the extra bits accumulate in the buffer. Only when 
more than 20,000 elements change for several consecutive frames, 
ie., for very violent motion covering a large area of the picture, is the 
overload mode needed. 


1916 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


XII. CONCLUSION 


Taking 8-bit PCM as the standard of coding quality, we have 
described a method of improving transmission efficiency. by a factor 
of nearly eight, permitting pictures to be transmitted at an average 
rate of one bit per element. Most viewers agree that there is little 
visible difference between the processed picture and the original. The 
quality is certainly better than has been obtained using 16-level 
differential coding. 

The improvement in efficiency is largely obtained by eee the 
use of transthission capacity for elements that do not change from 
frame to frame. Additional saving is obtained by taking advantage 
of the fact that reduced resolution is tolerable in changing scenes. Thus, 
frame-to-frame differences are quantized in amplitude, and sampled 
at half of the Nyquist rate. In this way the number of bits generated 
in every field is kept approximately equal to that which can be trans- 
mitted in a field time. A buffer is used to smooth the data flow over 
a field-time, its fullness serving as an indication of the rate at which 
the data is being generated. This measure is used for controlling the 
behavior. of the coder. 

Important subjects which have not been described here are the 
possibilities of using line-to-line correlation in the coder, the ad- 
vantages of mixing the data from several coders in order to obtain a 
more even data flow, and the effect of having an input signal that has 
already been modulated or coded in a prior transmission. 
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Low-Loss Microstrip Filters Developed 
by Frequency Scaling 


By W. W. SNELL, JR. 
(Manuscript received January 21, 1971) 


Low-pass and band-pass microstrip filters up to 830 GHz have been 
built on silica substrates by photolithographic reduction of optimized 
low-frequency models. Scaling accuracies of 1.8 percent or better have 
been achieved. An insertion loss of 0.28 dB has been measured for a 
five-section low-pass filter with a cutoff frequency of 5.7 GHz. The 
insertion loss of a 12-percent band-pass filter at 8.18 GHz is 0.16 dB. 
Good agreement between measured and calculated losses has been 
obtained. Circuit dimensions are given to facilitate scaling of the low- 
pass and band-pass microstrip filters to other frequencies. 


I. INTRODUCTION 


Microstrip filters are used in frequency converters, frequency multi- 
pliers, and branching networks for RF systems. Microstrip filters have 
been built in the past with photo-etched conductor patterns on ceramic 
substrates such as alumina, sapphire, and beryllia.1? These materials 
are useful in microwave integrated circuits at a few gigahertz where 
a desirable size reduction relative to free-space wavelength is achieved 
because of the high dielectric constant of the substrate. At higher 
frequencies, size reduction is a disadvantage because the circuit ele- 
ments are approaching the limitations of photolithographic reproduc- 
tion. A suitable substrate material for use at higher frequencies is 
clear fused silica. Its relative dielectric constant over a wide fre- 
quency range is 3.82 and its dielectric Q is 3,000 or higher at all fre- 
quencies up to 50 GHz. 

The purpose of this paper is to show that optimized microstrip 
filters at microwave and millimeter-wave frequencies can be built by 
linear reduction of a low-frequency model. The flow chart of this 
procedure is shown in Fig. 1. The oversize microstrip filter is designed, 
built, and tested at a few hundred megahertz. All circuit dimensions 
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Fig. 1—MIC flow chart (general). 












are then reduced N times in order to obtain the optimized microstrip 
filter at microwave or millimeter-wave frequencies. 

The initial design and testing of the microstrip filters described in 
this paper are made at 285 MHz for the low-pass filter and 818 MHz 
for the band-pass filter. The filters are scaled by factors of 20 and 10 
times, respectively, to produce 5.7-GHz and 8.18-GHz circuits. A 
scaling factor of 37.5 times is also used to obtain low-pass input and 
band-pass output filters for a 10.1-GHz to 30.3-GHz frequency multi- 
plier which is described in another paper.® 


II. MICROSTRIP SUBSTRATE 


Fused silica, Si02, is selected as the microstrip substrate for its 
desirable properties. These properties are as follows: 


(t) The dielectric constant and loss tangent are low. 
(iz) The dielectric constant is independent of frequency up to 50 
GHz. 
(171) The surface can be highly polished. 
(iv) It is commercially available in a wide range of sizes and thick- 
nesses. 
(v) It has a low thermal expansion coefficient. 


Table I lists the electrical and physical properties of fused silica 
and other common dielectric materials. Soda-lime glass, although not 
normally used as a microstrip substrate, is included as a low-cost 
material for price comparison. Detailed data on dielectric constant, 
loss tangent, and volume resistivity of high-purity synthetic fused 
silica are given in the appendix. Since silica has a low thermal con- 
ductivity of 14 xX 10° watts/em°K it is recommended for low- or 


medium-power applications. 
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III. SCALING OF MICROSTRIP CIRCUITS 


The laws governing scaling of electromagnetic circuits are described 
by Stratton. In order to obtain circuits with similar characteristics at 
different frequencies, each with a characteristic length J, an RF fre- 
quency o, a conductivity o, a permeability mw, and a permittivity «, it is 
necessary and sufficient that: 

pe(wl)” = K,, (1) 

peowl = Ky, (2) 
where K, and Kz are constants. Scaling has not been widely used for 
building microstrip circuits because high dielectric constant materials 
are expensive and difficult to obtain in large sizes and also because air 
gaps between these materials and metal conductors have a significant 
effect on the circuit performance and are difficult to scale. These prob- 
lems are substantially reduced by using a low dielectric constant 
material such as silica. 

A detailed flow chart of the scaling process is shown in Fig. 2. All 
circuit dimensions are reduced by the same factor by which the fre- 
quency is increased, thus satisfying equation (1). Equation (2) is only 
partially satisfied because brass conductors have been used in the over- 


TABLE I—PrRopPERTIES OF MIC SusstRAtTEs 






































Soda-Lime 
Alumina Glass 
Silica Al,O3 Beryllia Corning 
Property Units SiO2 (99.5%) BeO(99.5%) #0080 
Dielectric Con- 
stant at 10 
GHz, 25°C 3.82 9.8 6.8 6.71 
Loss Tangent 
10 GHz x10-4 1 4 3 170 
Thermal | 
Expansion PPM/°C 0.35 6.6 7.5 9.2 
Relative Cost 
Factor* 6 15 125 1 
Surface 
Condition Polished Polished Polished Drawn 











* Relative cost factor of substrate material compared to drawn soda-lime glass = 1 
(Data from Handbook of Thin Film Technology, edited by L. I. Maissel and R. Glang, 
McGraw-Hill, 1970, Section 6-43). 
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Fig. 2—MIC flow chart. (detailed). 





size model and gold conductors are used in the reduced size circuits. 
This leads to a slight increase in the scaled circuit losses because the 
skin effect loss increases with the square root of the frequency. 


IV. LOW-PASS FILTER 


The low-pass filter is a five-element network of three shunt ca- 
pacitors, C,, Cs, and Cs, and two series inductors, Lo and Ly, de- 
signed for a 0.2-dB Chebyshev frequency response. The microstrip 
layout of the filter is shown in Fig. 3. The total filter length is slightly 
less than A,/5; A, is the free-space wavelength at the 0.2-dB cutoff 
frequency. The length of each element is less than one-eighth of a 
microstrip wavelength, which allows the initial design of the filter to 
be based on a lumped circuit concept. The filter parameters C, and L, 
are computed from normalized prototype parameters g,, the input 
impedance R = 500, and the cutoff frequency wo, = 2zf-: 


1 

C, = wR Jn » (3) 
R 

L, = We ae (4) 


Table II lists the parameters of the filter. The 0.2-dB cutoff fre- 
quency is 285 MHz. The thickness of the diclectric substrate is 0.5 
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Fig. 3—Low-pass filter (circuit). 


inch and its relative dielectric constant is 3.82. The computed filter 
area is based on the choice of a specific cutoff frequency and does not 
take into account distributed line effects at higher frequencies. The 
experimental filter has been modified in order to optimize the per- 
formance up to 4, . 

The computed dimensions of the capacitive plates shown in Fig. 3 
are found as follows: 


TaBLE I]—Low-Pass Fitter PARAMETERS 


Design Computed Filter Experimental 
Parameters Dimensions Filter Dimensions 
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Aspect A in.? or In- 

Ch In Ratio ductance Area 
n Jn pF nH a/b Length J, in. ain. | bin. in.? 
1 | 1.339 | 15 | 3.59 Aj 8.07 1.42/50 | 7.1 
2 | 1.337 37.4 — hh = 1.82 0.70 | 1.62 
3 2.166 | 24.2 2.91 A; = 10.23 1.72 | 5.0 8.6 
4 1.337 37.4 l, = 1.82 0.70 | 1.62 | 
5 1.339 |} 15 3.52 As = 5.67 1.42 | 5.0 7.1 
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(t) An effective area A.,;, is defined as the area of a capacitor plate 
which has no fringing fields. This area is given by 


Assy ae iy (5) 


dielectric substrate thickness =0.5 inch, 

relative dielectric constant, and 

8.85 pF/m. 

(it) The actual area A of the capacitor plate is smaller because of 


fringe field effects. If a and b are the width and length of the capacitor 
plate shown in Fig. 3, then A.,, is given by 


Ais; = (a@ + at)(b + at), (6) 


where @ is a normalized factor which describes the effect of the fringe 
field. 

(iz) The aspect ratio of the capacitor plate shown in a 3 is defined 
as r = a/b and the factor B by 


where it 
€, 
Eo 


B= Vit Ye (7) 


r 


(w) The area A is obtained from equations (5), (6), and (7): 


= Cat (3° _ jee) _ api 
ae ({ “ 4 2 ®) 
One finds by empirical methods that the effective increase in each 


plate dimension due to the fringe field is approximately equal to the 
substrate thickness t; thus, a = 1 and equation (8) is simplified to 


sn f((ere — | of - 


Vv. MICROSTRIP INDUCTORS 


The inductance L per unit length for a narrow microstrip conductor 
with a width w and a substrate thickness h is 


60 In (3 am *) 
| iar penn aL , (10) 
UV, 
where v, is the velocity of light in free space, v, = 3 X 10° cm/s = 


1.18 x 10*° ips. The length 1, of the inductor element is given by the 
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ratio L,,/L where L, is the required inductance given in Table II, 


je ee =, (11) 
z 60w, In (3 ae | | 
vs w. 4h 


The computed length for the inductive elements is 1.82 inches. The 
actual length of the final inductive elements in the fiiter is 1.61 inches 
because the current constrictions in the capacitor plates increase the 
effective length of each element. . 

Calculated and measured transmission loss for a low-pass filter 
with a cutoff frequency of 285 MHz is shown in Fig. 4. The transmis- 
sion is plotted as a function of normalized frequency w/,. Figure 5 
shows the frequency response after scaling the filter by a factor of 
20X. The new cutoff frequency is 5.6 GHz and the scaling accuracy is 
1.8 percent. . 
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Fig. 4—Low-pass filter (comparison of calculated and measured data points). 
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Fig. 5—5-element low-pass filter (results of scaling from 285 MHz to 5.7 GHz). 


VI. BAND-PASS MICROSTRIP FILTER 


Band-pass filters utilizing two parallel coupled 4/2 resonators have 
been designed, built, and optimized at 820 MHz and scaled to 8.2 
GHz and 30 GHz. Figure 6 shows the microstrip lay-out of the con- 
ductor pattern on the silica substrate. The filter has a 0.2-dB 
Chebyshev ripple and 20 dB attenuation at the +33-percent band- 
width points. The calculated and measured filter responses are shown 
in Fig. 7 and ‘the filter characteristics, after scaling by a factor of 
10X, are plotted in Fig. 8. The scaling accuracy for the midband fre- 
quency is better than 1 percent. The 7-percent decrease in bandwidth 
is due to undercutting of the conductor elements during processing of 
the microstrip pattern. Undercutting is etching of the conductor pat- 


tern beneath the edge of the protective photoresist layer. It decreases 
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Fig. 6—Band-pass filter (circuit). 


all conductor dimensions by a small distance. The effect of under- 
cutting is most noticed on small critical circuit dimensions such as the 
0.008-inch-wide coupling gap of the 4/2 resonator. Undercutting can 
be controlled by establishing the undercut dimension and adding this 
to all circuit elements on the original artmaster. 
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sue 7—Experimental and theoretical attenuation of parallel coupled band-pass 
ter. 
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Fig. 8—Microstrip band-pass filter (results of scaling from 800 MHz to 8 GHz). 


VII. ATTENUATION OF MICROSTRIP FILTERS 


The dissipative loss in the pass band of a multiple-resonator filter is® 





_ —10_ ys 9: 

Las w’ In 10 > Q; dB, (12) 
where w” is the fractional bandwidth of the filter, g; is the normalized 
prototype filter parameter, and Q; is the unloaded Q of each filter 
element. 

The unloaded Q; of a microstrip element is 


207 ] 
eae 10 aro’ 








(18) 


where A, is the free-space wavelength, and a, is the free-space attenu- 
ation in dB per unit length as derived by M. V. Schneider,® where 
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10R, (® 2) er ey 
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or 


SCALED HYBRID INTEGRATED CIRCUITS 








1929 


_ __ ZR, 0.44h? ny | w au) 
a = aay | 1 + w co er Per ary, 


h 
. w/h < 1. 
Z, is the characteristic impedance of the microstrip line: 
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Z, = 60 In (8 +2) w/hS1 
Ge ae 
w 5h 
r w 
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(16) 


(17) 


R, is the skin resistance of the metal conductor as shown in Fig. 9, h 
is the conductor height, and w is the conductor width. The partial 


derivative dw/dt is given by: 


ow 1, 4rw i) 
polly ge aris secu < — 
ot In t w/h 20 
ow re, ] 
—_- = = —— w > — 
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(18) 


(19) 


where ¢ is the conductor thickness. The calculated and measured dis- 
sipative losses for the low-pass filter and for the band-pass filter are 


shown in Table III. 
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Fig. 9—Skin resistance of metals. 
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TaBLE IJJ—INsERTION Loss MEASUREMENTS 


Insertion Loss in dB 


Filter Calculated Measured 








Loss-Pass 
f-. = 5.7 GHz 
5 sections 0.13 0.28 


Band-Pass ain (ie 


2 sections 0.11 0.16 


Unloaded Q of Single Band-Pass Resonator 
Calculated Q 546 
Measured Q 375 


The calculated loss and the calculated Q factor are based on the 
assumption that the conductor resistivity is independent of frequency 
and equal to the de resistivity of the metal. 


VIII. CONCLUSION 


It is shown that scaling is a powerful tool for building and optimiz- 
ing microwave integrated circuits. A five-element low-pass filter and a 
two-element band-pass filter have been built and measured in the 
200- to 800-MHz frequency band. The low-pass filter has been scaled 
to 5.7 GHz and 10 GHz, and the band-pass filter has been scaled to 
8.2 GHz and 30 GHz. Good agreement between measured and calcu- 
lated losses has been obtained. Unloaded Q factors of 375 at 8.2 GHz 
are reported. 
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APPENDIX 


Properties of High-Purity Synthetic Fused Silica 

Table IV gives detailed data on the dielectric constant, loss tangent, 
and volume resistivity of high-purity synthetic fused silica produced 
by chemical vapor deposition (Dynasil No. 4000). 
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TaBLe IV—MIT Measurements or Dynasit No. 4000 (Juny 1970) 


Dielectric 





Constant 100 Hz 10 kHz 1 MHz | 10 MHz |8.60 GHz |24.0 GHz 
25°C 3.826 3.826 3.826 3.826 | 3.824 3.82 
100°C 3.834 3.834 3.834 3.834 | 3.832 3.83 
200°C 3.843 3.843 3.843 3.843 | 3.841 3.84 
300°C 3.861 3.856 3.854 3.854 | 3.852 3.85 
400°C 4.043 3.870 3.865 3.865 | 3.863 3.86 
500°C 4.795 3.886 3.878 3.878 | 3.876 3.87 
Loss 
Tangent 
25°C < .000004; <.000005 .000015 .00002; .00012 .00033 
100°C < .000004| <.00002 | <.00005 | <.0001 .00012 .00032 
200°C .00137 .00003 | <.00005 | <.0001 .00012 .00026 
300°C .0930 .00093 | <.00005 | <.0001 .00012 .00025 
400°C 2.03 0207 .00021 | <.0001 .00012 .00022 
500°C 11.36 .140 .00140 .00014| .00012 .00020 
Volume logio p 
Resistivity | ohm-cm 
25°C >15.0 >12.9 10.5 9.34 5.65 
100°C >15.0 >12.4 >9.95 | 8.65 5.65 
200°C 12.5 12.2 >9.95 >8 .65 5.65 
300°C 10.9 10.9 >9.95 >8.65 5.65 
400°C 9.34 9.34 9.34 >8 .65 5.65 
500°C 8.52 8.52 8.52 8.52 5.65 


Source: os von Hippel and W. Westphal, Laboratory of Insulation Research, 
T. 
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A Scaled Hybrid Integrated Multiplier 
from 10 to 30 GHz 


By M. V. SCHNEIDER and W. W. SNELL, JR. 


(Manuscript received February 18, 1971) 


Frequency multipliers which are built on a dielectric substrate have 
many useful applications in radio systems. The multiplier circuits 
which include input and output filters, idler circuits, and matching 
networks can be produced by integrated circuit processing techniques. 
Varactor diodes, also made by batch processing, can be bonded to the 
metallized substrate. 

A hybrid integrated three-times frequency multiplier from 10 to 30 
GHz has been designed and built on a silica substrate by using fre- 
quency scaling and integrated circuit processing techniques. The multi- 
plier produces a CW output power of 50 mW for a pump power of 178 
mW with an overall efficiency of 29 percent. The maximum output 
power 1s 150 mW. Other harmonic generators with different output 
frequencies can be built by scaling the extsting circuit. 


I. INTRODUCTION 


Frequency multipliers are used in radio systems for downconverters, 
for upconverters, or for locking solid state oscillators. A typical exam- 
ple is the multiplier chain used in the short hop radio system experi- 
ment+ which delivers 100 mW at 10.46 GHz for the transmitter and 
10 mW at 10.66 GHz for the receiver. Similar systems above 10 GHz 
for common carrier applications will require a few tens of milliwatts 
of transmitter power? and approximately 10 mW for the receiving 
downconverter. A compact and low-cost repeater package can be manu- 
factured by using integrated circuit technology for building components 
and subassemblies. This can be achieved by using low-loss dielectric 
substrates, fully shielded circuits, and a circuit design which does not 
require high Q factors. 

All three basic requirements are fulfilled for the frequency multi- 


1933 
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plier described in this paper. The dielectric substrate is silica, the cir- 
cuit is fully shielded, and low-Q filters are used for separating the 
pump frequency and the output frequency. The pump frequency is 
compatible with the output of existing high-efficiency multipliers used 
in the short hop system experiment.1 The substrate metallization as 
well as the planar diffused GaAs varactor diodes are fabricated by 
using integrated circuit processing steps. Optimum dimensions of all 
conductor patterns are obtained by building and testing an oversize 
model of the complete multiplier at. a lower frequency, that is, with a 
pump frequency of 270 MHz and an output frequency of 810 MHz. 
Circuit adjustments are easily made in the oversize multiplier and no 
adjustments are necessary after the oversize model is reduced to its 
final size. Good stability and spurious-free operation is obtained for the 
oversize multiplier with an 80 percent efficiency and a power output of 
2.5 W at 818 MHz. 


II. DESIGN AND FABRICATION OF INTEGRATED MULTIPLIERS 


Frequency multipliers have been built in the past with coaxial and 
waveguide circuits by using mesa type,® or planar. diffused, Si or 
GaAs‘ varactor diodes with the highest possible figure of merit. The 
basic design principles for building multipliers are well known’ and 
overall efficiencies close to the theoretically expected values have been 
achieved up to X-band frequencies. Some effort has been made in 
building high-performance hybrid integrated multipliers up to X-band, 
but the design of good filters on commonly used alumina substrates is 
difficult at millimeter-wave frequencies and also external tuning ele- 
ments are usually needed to achieve good overall efficiency. This dif- 
ficulty can be resolved by using a high-quality dielectric substrate with 
a relatively low dielectric constant and a simple conductor pattern 
which can be clearly separated into a low-pass input filter, an idler 
circuit, and a band-pass filter. Before scaling, the filter characteristics 
and the performance of the complete multiplier are tested in an over- 
size model. The model is assembled with clear silica plates with a 
relative dielectric constant e«, = 3.8. Small air gaps in the oversize 
model between the ground plane and the dielectric substrate and air 
gaps between the conductor pattern and the substrate are not critical 
because of the low dielectric constant of silica. The junction capaci- 
tance and the dimensions of the varactor diode are also scaled by the 
same reduction factor which is used for scaling the microstrip circuit. 
The parameters which are more difficult to scale are the cutoff fre- 
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quency of the diode and the skin depth of the microstrip conductors. 
This problem has been reduced by using a low-cutoff silicon diode for 
the low-frequency model and a high-cutoff gallium arsenide diode in 
the final circuit at 30 GHz. The skin depth is partially scaled by using 
brass conductors for the low-frequency model and gold conductors 
for the final circuit. 


21 Multiplier Conductor Pattern 

The complete conductor pattern of the multiplier deposited on a 
silica substrate is shown in Fig. 1. The pattern consists of three sec- 
tions, a low-pass input filter, an idler section, and a band-pass output 
filter. The input and the output is a microstrip line with a character- 
istic Impedance of 50 ohms. The substrate is inserted into a channel in 
a brass block, shown in Fig. 2, with a coaxial-to-microstrip transition 
at the input and a microstrip-to-waveguide transition at the output. 
A planar diffused GaAs varactor diode is mounted between the sub- 
strate metallization and a metal tab which is in contact with one side- 
wall of the channel shown in Fig. 2. The multiplier is completely 
shielded by a metal plate which covers the brass channel. 


ic GHZ, 502 MICROSTRIP INPUT 
i! GHZ LOW-PASS FILTER 


le GHZ IDLER CIRCUIT 


i> VARACTOR DIODE CONTACT AREA 


30 GHZ BAND-PASS FILTER 


30 GHZ, 502 MICROSTRIP OUTPUT 





Fig. 1—Conductor pattern of hybrid integrated multiplier on silica substrate. 
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Fig. 2—Hybrid integrated multiplier from 10 GHz to 30 GHz with transitions 
from coaxial transmission line to microstrip, and microstrip to RG-96/U 
waveguide. 


2.2 Low-Pass and Band-Pass Filter Design 


The low-pass filter used in the multiplier is a five element semi- 
lumped structure consisting of three capacitive and two inductive sec- 
tions. The element values for a Chebyshev filter with a 0.2-dB ripple 
are computed from tables given by G. L. Matthaei.? The cutoff fre- 
quency is 8300 MHz for the oversize model and 11.2 GHz for the scaled 
filter. The attenuation of the filter at the second harmonic of the pump 
frequency, which is the idler frequency of the multiplier, is 35 dB. 
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The band-pass filter consists of two parallel-coupled microstrip res- 
onators. The 3-dB bandwidth of the oversize filter with a center fre- 
quency of 810 MHz is 160 MHz. The 3-dB bandwidth of the reduced 
filter with a center frequency of 30.4 GHz is 6.0 GHz. The attenuation 
of the filter at the second harmonic of the pump frequency is 25 dB. 
The scaling of microstrip circuits, the design of the filters, and the 
measured characteristics of low-pass and band-pass microstrip filters 
on silica are described in more detail in a separate paper.® 

The input and output impedance for each filter is 50 ohms. An ad- 
ditional matching network or a modification of each filter is necessary 
to match the input impedance Zin, and the output impedance Z,., of 
the multiplier. Zin and Zou; are functions of the pump frequency w, , the 
nonlinearity coefficient y, the breakdown capacitance C’; , and the drive 
D of the varactor diode. For a tripler with f, = o,/27 = 10 GHz, a 
nonlinearity coefficient y = 0.38, a breakdown capacitance Cz = 0.090 
pF, and a drive D = 1.6, one obtains 


Fily, D) _ 0.214 


HERS Ge a (1) 
= F(y, D) 7 0.087 _ 
7 i = on Ga 15.5 Q, (2) 


where the values of the functions F,(y, D) and F.(y, D) are obtained 
from tables given by C. B. Burckhardt.® The transformation from 38 
ohms to 50 ohms at the input is achieved by a quarter-wave microstrip 
section with an impedance of Z = W50Z;, = 43.5 Q. This section is also 
part of the idler circuit shown in Fig. 1. Its electrical length at the idler 
frequency is slightly longer than half a wavelength and its inductive 
reactance seen across the varactor terminal resonates with the average 
diode capacitance at the idler frequency. A short additional line section 
is required to tune the reactive part of the diode package. 

The transformation from 15.5 ohms to 50 ohms is obtained by ad- 
justing the coupling between the contact area of the varactor diode and 
the adjacent half-wavelength microstrip resonator. The spacing between 
the conductor edges is optimized in the oversize model of the multiplier 
and no adjustment is necessary after the model is reduced to its final 
size. 


2.3 Diode Fabrication and Packaging 
The planar diode used for the integrated multiplier is a zinc-diffused 
junction in n-type epitaxial gallium arsenide with a doping level of 
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2 X 10” carriers/ce and an epitaxial layer thickness of 1.2 um. The 
processing is done by the photoresist, etching, and zinc-diffusion proc- 
ess developed by C. A. Burrus’? for fabricating planar millimeter-wave 
varactor diodes with zero-bias cutoff frequencies above 500 GHz for 
breakdown voltages between 10 and 20 volts. Contact to the gold- 
plated junction with a diameter of 20 um is made with a gold wire 
with a diameter of 75 um and a length of 150 um. The gold wire is 
embedded in epoxy and the diode is mounted in the circuit between the 
gold-plated contact area on the silica substrate and a spring-loaded 
ground contact made with a gold-plated phosphor bronze tab which is 
shown in Fig. 2. The diodes can be easily replaced and the mount has 
the smallest possible series inductance which can be achieved without 
making a hole in the silica substrate. 


Ill. PERFORMANCE OF HYBRID INTEGRATED MULTIPLIER 


The microstrip and diode properties used for building the hybrid 
integrated multiplier are listed in Table I. The output power of this 
multiplier is 50 mW at 30.38 GHz for a pump power of 178 mW at 10.1 
GHz. The available output power at burnout is 150 mW. A plot of the 
output power as a function of pump power is shown in Fig. 3 and spe- 
cific performance data are listed in Table II. 

The output is measured in RG-96/U rectangular waveguide by using 
a microstrip-to-waveguide transition which is similar to the side 
launcher developed by K. H. Knerr.* The only major difference is that 
the Knerr launcher is used for balanced microstrip line while the pres- 


TABLE I—SUBSTRATE AND DIODE PROPERTIES 


Substrate Material Silica 


Dielectric Constant 
Loss Tangent 
Substrate Thickness 
Substrate Dimensions 
Substrate Metallization 


Weight of Gold 
Varactor Diode 


Epitaxial Layer Thickness 
Doping Level 

Mobility and Resistivity 
Properties of N* Material 
Junction Diameter 

Wafer Thickness 

Wafer Dimensions 


er = 3.828 + 0.003 at 8.5 GHz 

104: tan 6 = 1.29 + 0.2 at 8.5 GHz 

0.34 mm 

4.06 mm X 20.4 mm - 

100 A Nichrome and 1000 A Gold 
evaporated, 3 um Gold electroplated 

4.7 milligrams (0.57 cent) 

Planar zinc-diffused junction into epitaxial 
GaAs N/Nt 

1.2 um 

1.9 X 10" carriers/cc 

4290 cm?2/Vsec, 0.0076 Qcm 

Orientation (100), Te doped, 1x 107-3 Qcm 
20 wm 

0.20 mm 

0.35 X 0.35 mm 
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Fig. 3—Output power of multiplier at 30.8 GHz as a function of pump power 
at 10.1 GHz. 


TABLE II—INTEGRATED FREQUENCY MULTIPLIER PERFORMANCE 





Input Frequency 10.1 GHz 

Input Power 178 mW 

Output Frequency 30.3 GHz 

Output Power 50 mW 

Efficiency 29% 

DC-Bias Self-bias, 6 mA at 3V 

Capacitance at Zero Bias 0.15 pF 

Capacitance at Breakdown 0.09 pF 

Breakdown Voltage 15 volts 

Input Microstrip Filter Five element semi-lumped low-pass filter 


with three capacitive and two inductive 
sections, cutoff frequency 11.2 GHz 

Output Microstrip Filter Band-pass filter with two parallel-coupled 
microstrip resonators, center frequency 
30.4 GHz, 3-dB bandwidth 6 GHz 


Input of Multiplier Coaxial-to-microstrip transition, 50-ohm 
; coax to 50-ohm microstrip 
Output of Multiplier Microstrip-to-waveguide transition, 50-ohm 


microstrip to RG-96/U waveguide 
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ent launcher is earned for a standard microstrip line on a silica 
substrate. 

The tuned output power versus saibngy for a pump power of 137 
mW is plotted in Fig. 4. The 3-dB bandwidth is 2.5 GHz or 8.5 percent 
and has been achieved by optimizing both diode bias voltage and 
plunger position of the microstrip-to-waveguide transisition at each fre- 
quency. The maximum output power is obtained at 30.3 GHz and is 
related to the fixed frequency resonance of the idler which occurs at 
20.2 GHz. The corresponding oversize multiplier is optimized for an 
input frequency of 272.7 MHz and an output at 818 MHz. .The scaling 
factor used for building the multiplier shown in Fig. 2 is 37.5 times 
and the optimized output after reduction of the circuit should be 
reached at an output frequency of 30.7 GHz. This means that the total 
error in scaling is 1.3 percent. The error is very small because the pho- 
tolithographic technique is one of the most accurate methods to reduce 
the physical dimensions of a microstrip circuit and also because the 
dielectric constant of the substrate does not change with frequency.?? 

Figure 5 shows the frequency spectrum of the multiplier from 29.3 
to 31.3 GHz. The output is stable and free of spurious signals to the 
maximum sensitivity of the spectrum analyzer which is 40 dB below 
the output of the multiplier. Good stability and spurious-free opera- 
tion are also obtained for the oversize multiplier. with an 80-percent 
efficiency and a power output of 2.5 W at 818 MHz. 


Po ee Ld 


Simm 


| PUMP PoweR=137MW —\ POWER = 137 | PUMP PoweR=137MW —\ 


OUTPUT POWER IN MW 





OUTPUT FREQUENCY IN GHZ 


Fig. 4—Tuned output power versus frequency of multiplier for fixed pump 
power of 137 mW. 
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Fig. 5—Output spectrum of multiplier from 29.3 to 31.3 GHz. 


IV. CONCLUSIONS 


Hybrid integrated multipliers on a dielectric substrate can now be 
built with output frequencies up to 830 GHz. The power levels and the 
overall efficiencies which are obtained with a single planar diffused 
gallium arsenide diode are useful for many future millimeter-wave 
systems applications.?"* Scaling of oversize circuits is a powerful 
technique for building optimized integrated millimeter-wave circuits. 
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Some Analytical Investigations of 
Phase Contrast Imaging 


By C. M. NAGEL, JR. 
(Manuscript received December 1, 1970) 


An analytical study of phase contrast imaging is performed. It 1s 
shown that the complex disturbance in the image plane can be repre- 
sented as a convolution of the object disturbance and the Fourier 
transform. of the transmission function of the phase plate. The sumple 
theory of the phase contrast microscope is then derived as a lamiting 
case of this more general result that is applicable when the size of the 
phase object 1s small compared to the area of the entrance aperture of 
the system. The response of a general system to several simple large 
phase objects is also examined, and it is shown that qualitative infor- 
mation about these objects can be obtained from the intensity pattern 
when the phase perturbations are small, providing the background ts 
sufficiently uniform, and the size of the phase spot on the phase plate is 
carefully chosen. The study provides insight into the type of perform- 
ance that can be achieved, for example, if phase contrast imaging 1s 
used to extract phase information from an optical memory or if it 
is used as an experimental tool to study the qualitative behavior of 
such phenomena as clear air turbulence. 


I. INTRODUCTION 


One of the most popular techniques for converting a spatial phase 
variation into a spatial intensity variation is the phase contrast 
imaging scheme originally proposed by F. Zernike** in 1935. Pro- 
ceeding mainly from a heuristic point of view, Zernike recognized 
that, with a conventional imaging system, most of the direct light 
passes through a small region R in the focal plane while, in many 
instances, most of the diffracted light is scattered away from this 
region. If a plate whose transmission function is given by 


T = " in R (1) 
1 otherwise 
1943 
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is placed in the focal plane, the direct light can be modified and made 
to interfere with the diffracted light to produce an intensity pattern 
in the image plane. Moreover, if the phase perturbations ¢ that are 
introduced by the object are small enough so that the approximation 


e* V1t+ jp 
holds, the intensity in the image plane and the phase of the object 
may be linearly related. These concepts led to the design of the phase 
contrast microscope. 

The key assumption in the theory is that the effect of the phase 
plate on the diffracted light is negligible. In 1958, H. H. Hopkins‘ 
rigorously demonstrated that, if this assumption holds, a phase object 
will produce an intensity pattern given by 


I= ta \ad*— 1+" ee (2) 


In this expression, M is the magnification of the system, and it is 
assumed in the derivation that the object is illuminated by a unit 
amplitude monochromatic plane wave. When ¢ is small, a linear 
relationship, 


pe on la? Dad sine): (3) 


results, as predicted by Zernike.t 

There are many potential applications of phase contrast imaging 
in which the basic assumption employed in Refs. 1-5 will not be 
valid. Therefore, it is interesting to examine what results will follow 
if these assumptions are not made. In this regard, Hopkins® rigorously 
studied the diffraction images of circular disks under coherent illumi- 
nation, and M. De and §. C. Som,’ and De and P. K. Mondal,® in- 
vestigated the images produced by similar objects under incoherent 
illumination. In this paper, we will derive the theory from the point of 
view of Fourier optics and demonstrate that, in general, the complex 
disturbance over the image plane can be represented by a convolution 
of the object disturbance and the Fourier transform of the transmission 
function of the phase plate. The approximate theory, equations (2) 
and (3), is then shown to be a limiting case of the general theory, 
applicable if the size of the phase object is small compared to the 
entrance aperture of the system. Such is usually the case when the 
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magnification of the system is large. For larger objects, the convolution 
integral yields a complicated relationship between the phase of the 
object and the intensity of the image. 

The response of the system to several simple large phase objects is 
then studied, and it is shown that qualitative information about the 
phase distributions of these objects can be obtained when the phase 
perturbations are small if an appropriate phase plate is employed. 

The work that is discussed in this paper complements the experi- 
mental investigations carried on at Bell Telephone Laboratories by 
N. J. Kolettis. 


II. GENERAL THEORY 


The optical system that we will consider is shown in Fig. 1. A 
circular entrance aperture with radius R, is located a distance d, in 
front of the lens, and a phase plate with a transmission function 


ja 


ae* rr, SR; 


T (rs) = | 1 Rs a vy < Ryo (4) 


0 Ryo ee Vs 


is centered in the focal plane. The phase object is assumed to occupy 
some portion of the entrance aperture, and the illuminating light is 
taken to be a normally incident unit amplitude monochromatic plane 
wave. . 

Let the complex disturbance in the object plane be denoted by 
U(X, Yo) where U, is assumed to be zero at points outside the 
entrance aperture. Then, from the Fresnel approximation,® it follows 





ENTRANCE LENS PHASE IMAGE 
APERTURE PLATE PLANE 


U=Upo (Xo, Yo) U=U¢ (XF,UF) U= Ui (xi, Yi) 


Fig. 1—System geometry. 
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that the complex disturbance in the focal plane to the left of the 
phase plate is 


U;(a; , ys) = 2 MOLE oa ne exp |# (1 — ts) =f vi | 


: d, d, 
ff U(x. ) yoP (a. a $ Uy Yo + f us) 





—ik 
"exp |=# (ta, + vai | dx, dy, (5) 


In this expression f is the focal length of the lens, \ is the wavelength 
of the illuminating light, k is the wavenumber 27/,, and knA, is the 
phase shift introduced by the lens. P;, is the pupil function of the lens 
which assumes the value of unity over the region covered by the lens, 
and is zero elsewhere. 

Now, the complex disturbance on the right side of the phase plate 
is simply U;T;, and, therefore, if the Fresnel approximation is also 
used to describe the propagation from this plane to the image plane, 
the complex disturbance in the image plane becomes 


UO (a; } Yi) 


= 4 ie If. U (aa: yP (2, oF oo, Yo + fy re, > Ys) 


12 
“exp (2 (Mx, + x,)x, + (My, + wun} dx, dy, dx; dy; , (6) 


where 


= —oxp (ita, +7 +d +n.a)} exp | # et + v9 | 


_ Here we are assuming that the distances d, , d, and f obey the lens law, 


t(1-4)+4=0, 


and we have denoted the magnification of the system, d/f, by M. 
The integral in (6) is complicated by the coupling of the coordinates 
Lo, Xe and Y,, Ys Via the pupil function of the lens. This phenomenon, 
known as the vignetting effect,? can be ignored if it is assumed that 
the lens is large enough so that the sum of the squares of the argu- 
ments of P; is always less than the square of the radius of the lens 
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for all values of 2, 2, yo, and y; for which U, and 7; are nonzero.? 
With this assumption P; may be taken as one. 
To further simplify equation (6) we introduce the notation 


x; 
M } 
Then, neglecting the vignetting effect, 


“= 





§: =F oS De Bes Y= Yot ¥: . 


Ua; , yi) = sd | U(e — #:,y — 9) F(T;| dx dy 
where 


F(T,] = a T(x; , ys) exp | — eit (xa, + ww | dx; dys. (7) 
We observe that the disturbance in the image plane can be written 
as a convolution of the disturbance in the object plane and the Fourier 
transform of the transmission function of the phase plate evaluated 
at the spatial frequencies Mx/dd, My/nd. 
Let us now examine the form of F[7;] more carefully. The transmis- 
sion function of the phase plate, from (4), is given by 


T,(r;) = (ae’* — 1) cire (7,/R%) + cire (r;/R;.) 





with 
1 = e+ uD! 
and 


cire (r,/R) = : ny skh 
0 rr, > R. 


It follows then that 


2nMR; ) (22MRn 
a yams P| Mo MR et 


Te xd ; Ta - 














F [7 ;] = 


(8) 
where 
r=(@+y)}, 
and J, is the Bessel function of the first kind of order one. 


+ Recall U. = 0 for x2 + yo? > Ro, and T; = 0 for x72 + y;? > Bro. 
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Now, at optical wavelengths, R,,/d is exceedingly large, and for all 
practical purposes, the second term in (8) can be approximated by a 
two-dimensional delta function with little error.t On the other hand, R? 
is typically of the order of the radius of the Airy disk of the diffraction 
pattern of the entrance aperture so that the first term cannot be so 
simplified. To emphasize this fact, R? will be written as 


_ At 
= V7 R, 
where 7 is a constant of the order of unity. (y ~ 0.61 if R? is the radius 


of the Airy disk.) 
With this notation and the definition of M, equation (8) becomes 


fe 
2 BP ~ 1 ; 
FI) >| (ae'* — 1) 2 Ae + a0 |- 9) 


Returning to the expression for the complex disturbance in the image 
plane (7), we conclude that 


Ry 


K ja s a a 
Uae) El aes —) 2 ff ufe-%,y- 4) 


2 1 
[22 (e+ i | Hoe Yi 
Gig dx dy + u{—% — 4) - (10) 
Equation (10) is the fundamental result that will be employed through- 
out the remainder of this paper. 





III. SMALL PHASE OBJECTS-THE PHASE CONTRAST MICROSCOPE 


_ The form of equation (10) indicates that, in general, a complicated 
relationship exists between the object and the image. We will now 
demonstrate, however, that the simple theory, equations (2) and (3), 
may be employed zf the ratio of the area of the phase object to the 
area of the entrance aperture is sufficiently small. This is usually the 
case for a microscopic system. 

We will assume that the phase object is centered in the entrance 


+ We shall observe later that this approximation yields sharp jumps in intensity 
when the phase object possesses phase discontinuities. In practice some smooth- 
ing of the discontinuities in the intensity pattern will always result. (See Ref. 
10 for example.) We are neglecting these lower order effects for simplicity. 
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aperture and write U,(a,, yo) as 


sss (x? ais va id 
U (to , Yo.) = cire kez +€ 1)P, , 


where P, has a value of one over the region covered by the phase 
object and a value of zero elsewhere. In the limit of a very small 
phase object,* the contribution of the second term in U, to the integral 
in (10) approaches zero, and the integral can be approximated by 


2rY , 2 2\1 
Pa ~ \2 Pee Fe bere +y)}| 

a & (G@=— 4) +U—F%) ] [a 

i= R, If cire | R, (a? Pe da dy. 


The convolution is most easily evaluated by first taking the Fourier 
transform of I and then taking the inverse transform of the results. 
These two operations yield - 








if = rad se ) dt (11) 


where 7; is the radial coordinate in the image plane. 

If we now restrict the coordinates in the image plane to the region 
corresponding to the image of the phase object, the ratio 7,/MR, will 
be small, and I can be approximated by 


2r7 
iow / Fiat 27 Owy, 
0 


Finally, with an appropriate choice of the radius of the central spot 
of the phase plate (and thus y), the contribution from the Bessel 
function Jo(27y) can be made to vanish, and we conclude that, for a 
sufficiently small phase object and a suitable y, . 


K Fe : i a 
Ut; ,y) & Ti | ae — 1) + exp | io( —& ; _#)|]. 


The intensity of the image is then 


ja ‘ _ vi = Yi 
(ae 1) + exp El VW’ #) 


which agrees with equation (3). The simple theory of the phase con- 
trast microscope is, therefore, a limiting case of the more general 
approach. 


2 














1 
= |U, | Raa 
Ia; ’ Yi) =a | U; | M 


+ Consider a single cell under a microscope, for example. 
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IV. MACROSCOPIC SYSTEMS 


In many potential applications of phase contrast imaging the mag- 
nification of the system may be of the order of unity, and the area of 
the phase object may be comparable to the area of the entrance 
aperture. In these cases, the approximations that were employed in the 
last section cannot be used, and the complex disturbance in the image 
plane can, in general, only be determined via a direct application 
of equation (10). In order to provide some representative results, we 
have chosen a few simple, yet important, examples for which the 
integral in (10) can be easily evaluated. 


4.1 Background 


The purpose of any phase contrast system is to give a visual repre- 
sentation of the phase distribution in the entrance aperture. If a system 
is to yield meaningful results, therefore, the intensity of the image of 
the aperture should be essentially constant when no phase objects are 
present. In this case 


2 2\} 
Ufa; ’ Yo) = cire | ete ss Ye) | 


and from - and (11) it follows that 


2 








igs af (ae’* — 1) ie Oe a t) dt +1 (r; < MR,) 


(12) 


where the notation zg = 27y has been introduced. 

For our later purposes, it is convenient to define a normalized in- 
tensity, 7 = M’°I. Plots of this function are shown in Figs. 2-5 for 
various values of z. Clearly, the response of the system to the illumi- 
nating light is critically dependent upon the size of the central spot of 
the phase plate. Where a = 1 and a = 7/2, a typical case that is em- 
ployed in phase contrast imaging, near uniform backgrounds result 
for values of 2 corresponding to spot sizes on the order of one-half the 
Airy disk or smaller.’ For larger z, the background varies considerably 
until several rings in the diffraction pattern of the entrance aperture are 
covered by the phase spot.? We shall see later that these variations 
produce distortions in the intensity pattern of phase objects when they 


* The task of making a phase plate with a spot size this small may be difficult. 
+The phase contrast system will begin to behave like a direct imaging system 
for spot sizes larger than those shown, and thus large z values are not practical. 
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AIRY DISK 
-~ 7 = 3.83171 





Fig. 2—Background intensity distributions for spot sizes less than or equal to 
the Airy disk (a =1, a = 7/2). 


are placed in the aperture. The case a = 1, a = 7/2, 2 = 1.5 will be of 
special interest to us, since in this case the integral in (12) is approxi- 
mately 1/2 except when r; is near the edge of the image of the aperture, 
the background is essentially constant, and several interesting results 
can be obtained. 

Frequently, a phase contrast system will employ a partially absorbing 
phase plate (a < 1) to improve the contrast between the image of the 
phase object and the background [see equation (3)]. Background in- 
tensities for a = 0.1, a = 7/2 and various values of z are shown in 
Figs. 4 and 5. There it may be observed that, in general, the back- 
ground intensity is reduced, although a bright ring appears in the | 
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: it \ 


Fig. 3—Background intensity distributions for spot sizes larger than the Airy 
disk (a = 1, a = 7/2). 


region close to the edge of the image of the aperture. (This ring has been 
observed experimentally by Kolettis.) If this edge effect is ignored, near 
uniform backgrounds are obtained for values of z that include five or 
six Airy rings. 

Taken together, the above results indicate that care must be exercised 
in designing a phase contrast system if near uniform backgrounds are 
to be achieved. 


4.2 Phase Disk 
Consider the “phase disk’ defined by 


U (to, Yo.) = (’*® — 1) cire ee ear Ee -- He) 


OSRY SR) (18) 


where ¢ is a constant. The geometry corresponds to a disk with radius 
R* centered in the entrance aperture which introduces a constant 
phase shift ¢ in the incident wave. Substituting in equation (10) and 
using the technique that was employed to derive equation (11), we 
obtain the intensity distribution 


asniuy ity 
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(Ge = Diet =DINe) Elta e* |, 








qa) 
Tr) = ae id) 
eae Ie = Ie) 1G) aay, 
MR;” <r; S MR,. 
In this expression I(r;) is the integral given in (11) and 
RY 2ry Ro» : 
I(r, — PR. : At R aes ) at. (15) 


Note that the integral I only takes into account the finite extent of the 
entrance aperture, while I“ includes the size of the phase object as well. 

Plots of (14) are shown in Figs. 6-8 fora = 1, a = w/2 and z = 1.5. 
The value of z has been chosen to correspond to a near uniform back- 


AIRY DISK 





Fig. 4—Background intensity distributions for spot sizes less than or equal to 
the Airy disk (a = 0.1, a = 7/2). 
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_Fig. 5—Background intensity distributions for spot sizes larger than the Airy 
disk (a = 0.1, a = 7/2). 
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Fig. 6—Response to a phase disk (@ = 0.1, R.2)/R, = 0.1). 
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(0) 0.2 0.4 0.6 0.8 1.0 
Ti /MRo 


Fig. 7—Response to a phase disk (¢ = 0.1, 8, /R,. = 0.5). 


ground when R{” = R, or when R‘” = 0. It is clear from the figures 
that, for the small value of ¢ chosen, the qualitative behavior of the 
phase distribution in the entrance aperture is reproduced quite nicely. 
The phase disk can be readily observed and the magnitude of the dis- 
continuity in I is on the order of ¢.* 

In Fig. 9 we have indicated the response of the same system with 
the exception that ¢ has now been assigned a value of +. Note that in’ 
this case, the Jump in T depends critically upon the size of the phase 
object. Moreover, it appears that a value of R‘” exists for which no 
contrast results and beyond which the contrast is reversed. This sug- 
gests that for larger @ values, the system performs quite poorly. 

_ To explain this behavior, we need only to return to equation (14) 
from which it immediately follows that the magnitude of the dis- 
continuity in J is . 

AI = 2 Re {(ae'* — 1)(e* — 1[e* — II? (MRS) + IMR) ]} 

2(a cosa — 1)(1 — cos¢)(2I™’ — I) + 2alsinasing. (16) 


For small ¢, | 


Al & (2al sin a)¢, 


while for larger ¢, the full equation (16) must be employed. Recalling, 
that for z = 1.5, I = 0.5 for most values of r; , we conclude that, for 


+ The increase in intensity in the outer portion of the image is due to the slight 
nonuniformity of the background in this region. 
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Fig. 8—Response to a phase disk (¢ = 0.1, R.{)/R. = 08). 
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a= 1,a= 7/2,z2 = 1.5, and ¢ small, 
Al ~ @ 


as observed. 
When ¢ = 7m, however, equation (16) reduces to 


AI = 4(a cos a — 1)[21 (MR) — ICMR” )). 


From the definitions of the integrals (11) and (15), it is clear that for 
@ = 7m there exists a value of R‘” for which AJ = 0 and beyond which 
the contrast is reversed. 

In the preceding section, we observed that nonuniform backgrounds 
can result when the size of the central spot of the phase plate is not 
carefully chosen. Figure 10 demonstrates the response of a phase con- 
trast system to a phase disk when this is the case. The value of z has 
been chosen so that the spot size corresponds to the Airy disk of the 
diffraction pattern of the entrance aperture. The background corre- 
sponding to this value of z is shown in Tig. 2. Note that the phase 
disk simply perturbs the background pattern yielding an intensity pro- 
file that does not have the desired step function shape. 


4.3 Phase Rings 


The analytical results presented in Section 4.2 indicate that a dis- 
continuity in the radial coordinate of the phase distribution of the en- 
trance aperture can be qualitatively reproduced in the intensity pattern 
providing a sufficiently uniform background is chosen and the magni- 
tude of the discontinuity is small. In other cases, such as a large dis- 
continuity or a nonuniform background, rather poor results are obtained. 
We will now generalize these results to a more complicated radial phase 
distribution. 

Suppose the phase distribution in the entrance aperture consists of a 
central disk and a sequence of annular rings in each of which the phase 
is some constant ¢, . Let the radii defining the location of the phase 
discontinuities be R‘“”, k = 1,---,n —1,andlet R*”’ = R,. Then U, 
can be written as 


n 2 2\3 
Uslts sY.) = US cre [ ee | (17) 
where 
U® = a [7@.] — exp [Jbx+1] k=1,::-,n—-1 


exp [1¢,] k=n. 
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Fig. 10—Response to a phase disk when the phase oor spot size equals the Airy 
disk of the entrance aperture (¢ = 0.1, Ro@)/R. = 0.2). 


The normalized intensity distribution immediately follows as 


2 








(ae"* — 1) DY UPI Cr) + exp [id] 
k=1 


OSr,S MR (18) 








(ae’* — 1) a OPI @.) + exp [drs 
MR? <r; <= MR”, l=1,-++,n—1. 


In this expression I“ (r;) is the generalization of (15) which is given by 


Be z (B= ) (<r :) 
(k) 
IG) = | TAG tll arin: t) at. (19) 


Plots of (18) are shown in Figs. 11-13 fora = 1, a = 7/2, andz = 1.5. 
The assumed phase distributions in the entrance aperture are given at 
the top of the figures and the resulting intensity profiles are shown 
below. A qualitative reproduction of the phase distributions results in 
each case. 

The behavior of (18) in the limit of small phases, ¢, , explains some of 
the success that was achieved in the figures. From the definition of the 
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coefficients US" it follows that, if all the ¢, are small and, in particular, 
if ¢, = 0, 

es wa 1, es 4 al 
1 k= 1. 


Therefore, the jumps in intensity at the points 7; = MR’, to first order 
in the phase jumps, is just 


_AT(MRS) & 2 Re {(ae'* — 1)jivr — oI (MR”)} 
= 2asin a(d, — bro 1 (MRS). (20) 


Now, in the cases shown in the figures, a = 1, a = 7/2, and z = 1.5. 
Recalling that for zg = 1.5, 1 = 0.5 we conclude that 


AI(MR\) & by — be+1- 


Thus, if the ¢, are sufficiently small, and an appropriate value of z 
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Fig. 11—Response to phase rings with decreasing phase. 
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Fig. 12—Response to phase rings with increasing phase. 


is chosen, the jumps in intensity are of the order of the jumps in phase. 

Figures 11-13 also indicate that, for the values of the parameters 
chosen, the piecewise constant behavior of the phase distributions is 
essentially reproduced in the intensity profile. The success here is 
mainly due to the fact that not only was z chosen to yield a nearly 
uniform background, but also the z value selected was relatively small. 
Under these conditions the dependence of the integrals I‘) (7;) is 
suppressed,* and the effect of the phase object is to introduce essen- 
tially piecewise constant perturbations to a nearly uniform background. 

To demonstrate what effects can occur for a larger z value, we have 
plotted in Fig. 14 the response of a phase contrast system to a 
sequence of phase rings when a = 0.1, a = 7/2, and z = 21.21163.+ 
This is a practical case to consider since small values of a are fre- 


+ Consider equation (19) in the limit of small z. ; 
t This value of z corresponds to a phase plate spot size that includes about 
five and one-half Airy rings. 
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quently employed in phase constant microscopy to improve the con- 
trast between the phase object and the background. (For our purposes, 
large z values are needed to give a fairly uniform background over a 
large portion of the image of the entrance aperture when a. is small.) 
Unfortunately, the results that are observed are then quite poor. The 
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Fig. 183—Response to phase rings with alternating phase. 


problem here is that the integrals I™(r;) are now highly nonlinear 
functions of 7;, and, therefore, nonlinear perturbations of the order of 
the background are introduced.* 


4.4 Semi-circular Phase Disks 


The results of the last section demonstrate that with an appro- 
priate choice of the parameters a, a, and 2, circularly symmetric phase 


t Figure 5 demonstrates that for the values of the parameters chosen, the back- 
ground is of the order of a? = 0.01. However, from equation (20), we observe that 
the jumps in /, which we may take as indications of the magnitudes of the perturba- 
tions of the background, are of the same order when ¢; — ¢x41 is small. 
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Fig. 14—Response to phase rings (a = 0.1, 2 = 21.21163). 


distributions possessing discontinuities in the radial coordinate can be 
qualitatively reproduced in the intensity patterns. As a final example, 
we now examine the response of a phase contrast system to a simple 
angular discontinuity. 

Let the coordinates in the object plane be r, and 4, and consider the 
complex disturbance 


U.(r. , 9.) = eire (r./R,) exp [jO(4.)], (21) 
where 


= 0< 687 
Yo wr< 60, < 27 
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with yi and y2 assumed constant. With this phase object, the entrance 
aperture is divided into two semi-circular regions in each of which 
the phase is constant. 

In order to calculate the intensity pattern that is produced by (21), 
we must compute the convolution integral (see equation (10)): 


y J i(Qryr/R.)_ 
it; r 


As in our previous analysis, we will accomplish this task by first taking 
the Fourier transform of I, and then taking the inverse transform of 
the result. 

Now, it is a well-known result in the theory of Fourier transforms? 
that if a function g(r, 0) is separable in r and 6, ie, g(r, 0) = 
gr(r) g6(9), then the Fourier transform of g can be expressed by the 
following infinite series of Hankel transforms: 





I,=U,* (22) 


foe) 


Fig) = do e(—j)'e*Hilg-()] 


k=—00 


where 


is af" (ae**? da 
‘ Qn f) Jo 
and 
H,lg-@)] = 2m a rg?) I,(2urp) dr. 


In these expressions p and ¢ are the coordinates in the transform 
space, and J; is the Bessel function of the first kind of order k. 
Applying this result to (22) yields 


ioe} Ro 
F[I,] = cire (20. > c(—jre™ | rJ,(2arp) dr (23)' 
k=—-© 0 


where 


| 4(exp [j1] + exp [jv2]), k& = 0 
0 k even. 


(24) 


Cc, = 
1 : 
a (exp [jv2] — exp [7mi]), & odd 


t Here we have used the easily established result that 


r| Bsmt. | = circ (pR./y). 
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_ The appropriately normalized inverse transform of (23) is 


Qr 
2 [ [ F[I.]o exp [j2nF.p cos (8; — 4)] dp dé 


foe] 


y/Ro Ro 
= Ag” > Cy exp [7k 6; | / p / rJ ,(2rrp) drJ (277 ; p) dp. 
az : _ SO 0 

Here 6; is the angular coordinate in the image plane, and we have already 
introduced 7; = r;/M for later simplicity. Now, letting ¢ = 27R,p, 
s = r/R, , and substituting expressions (24) for the coefficients c, 
finally gives 


1, = 5(exp [ini] + exp Lira) hulr) + = (exp lire] — exp Lin) 


° D3 hi(r:) vn ies (25) 
k odd ¢ 


where 


Cae ie t [ sJ.,(s1) asJ.( Wk, f | dt. (26) 


Note that the integral h,(r;) is just the integral given in (11), and that 
the integrals h,(r;) are higher order generalizations of it. 

The intensity in the image plane is obtained from equations (10) 
and (25) as 


| | (ae'* — 1)1, + exp [fv] ?, OS 6, <0 
I(r; , 9) = ‘ 
| (ae’* — 1)I, + exp [jr] |’, <0; < 27 | (27) 


Clearly J depends upon both the radial and the angular coordinates in 
the image plane while the phase distribution over the entrance aperture 
depends only upon @. Therefore, one would not expect, in general, that 
the semicircles would be visible in the intensity pattern. We will now 
demonstrate, however, that if a small value of zg = 2zy is used for 
which the background is essentially uniform, and if y. — 7, is small, 
the semicircles can be reproduced, and the jump in the normalized 
intensity will be proportional to y2 — y: , at least to first order. More- 
over, as in our previous analysis, if a = 1, a = 7/2, and z = 1.5, the 
first condition will be satisfied, and the proportionality factor will be 
approximately unity. 

To observe these facts we need only to substitute the first term of the 
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power series for the Bessel functions in (26) and conclude that for z 
sufficiently small 


2h+2 
z 


r,_\' 
hr) © (2) (2*k!)*(k + 2)(2k + 2) 


Thus for small: z, the coefficients h,(7,) are small, and they decrease 
extremely rapidly as k increases. Hence, if y2 — 7 is sufficiently small, 
the first term in (25) will dominate the second and I, can be approxi- 
mated by 





I, © 3(exp [ji] + exp [jve])h. (71). 
With this approximation the normalized intensity is approximately 


| 2(ae’* — 1)(exp [ii] + exp [ive])ho(rs) + exp [ival |’, 


06; Sr 


~ 


L[(r; } 6;) ~ 0 = vr; = MR, . 
| $(ae'* — 1)(exp [i] + exp [fv2)h(r.) + exp [ini] |’, 
Tv < 0; < 2 TT. 


Now, if the background is essentially uniform, the integral h,(r;) in 
the above expression is essentially constant, and we observe that, for all 
practical purposes, J is constant in each of the semicircular regions 
0< 6;< 7,7 < 0; < 2. Furthermore, the jump in J between these 
two regions is just 


AT = 2 Re {¥(ae'* — 1)(exp [— ira] — exp [— fri] (exp Lins] 
+ exp [jr2])h.(7:)} 
= 2asina h,(r;) sin (v2 — 1) 
YY 2asinah,(r;)\Y2 — V1)- 
From our previous work, we may recall that, for 2 = 1.5, ho(7;) = 0.5, 
and thus, fora = 1,a = 7/2, and z= 1.5, 
AT © (2 — 11): 


The same conditions that permitted radial discontinuities in the phase 
distribution to be qualitatively observed in the intensity pattern 
appear to permit a simple angular discontinuity such as (21) also to 
be observed. The representative results of numerical computations of 
the exact expressions (25), (26), (27) shown in Fig. 15 bear this out. 


1966 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


0.6 


0.5 


H 
N 
> 0.4 
Il 
HI 
Y= fe) 
0.3 Yo= 0.1 
ry/MRo =0.5 
0.2 





Fig. 15—Response to a semi-circular phase disk. 


V. CONCLUSIONS 


A general expression for the intensity distribution that is produced 
by a phase contrast imaging system with a circular phase plate was 
derived. It was shown that the results reduced to the well known 
expression for the phase contrast microscope when the size of the phase 
object is small compared to the area of the entrance aperture of the 
system. To obtain the intensity distribution for larger phase objects, 
a convolution integral must be evaluated by analytical and/or nu- 
merical techniques. 

The intensity patterns that are produced by phase disks, phase 
rings, and semicircular phase distributions were derived, and the re- 
sults of numerical computations that were based on these derivations 
were studied. In general, it appeared that qualitative reproductions 
of these simple phase distributions could be observed in the intensity 
patterns if the size of the phase spot on the phase plate was chosen 
to yield a uniform background, if the resulting parameter, z, was 
small, and if the magnitudes of the phase perturbations were small 
enough so that the approximation e* = 1 + jd held. Moreover, it 
appeared that, if these conditions were not met, rather poor reproduc- 
tions of even these simple phase distributions could result. 
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A New Equalizer Structure for Fast 
Start-Up Digital Communication 


By ROBERT W. CHANG 
(Manuscript received November 18, 1970) 


The use of transversal filters for automatic equalization has made 
possible high-speed data communication over voiceband telephone 
channels. Recently much attention has been focused on the possible 
use of the high-speed data sets in private line multiparty polling 
systems. However, for such applications, rt is necessary to reduce 
the start-up time of the present automatic equalizer drastically. This 
paper examines the start-up time (settling time) of the transversal 
filter equalizer for two important classes of data communication 
systems: Class IV partial-response systems and _ single-sideband 
Nyquist systems. (The latter represents the limiting case of vestigral- 
sideband systems with small roll-off bandwidth.) It is shown that in 
single-sideband Nyquist systems the input signals to the gain controls 
of the transversal equalizer may be nearly orthonormal. Consequently 
the equalizers may have a short settling tume. It 1s also shown that the 
equalizer settling tume 1s much longer in Class IV partial-response 
systems, because such systems use controlled intersymbol interference 
and the input signals to the gain controls are highly correlated. 

The possibility of reducing the settling time of the automatic 
equalrzers is examined. A new equalizer structure 1s developed based 
on the following principles: (i) Equalizer setthng time can be mini- 
mized by making the input signals to the gain controls orthonormal, 
and (2) Such a minimization does not change the noise power, the 
mean-square equalization error, the convexity of the gain control 
adjustment, and the feedback control loops in the equalizer. These 
principles are general in that they apply regardless of the type of 
modulation—single- , vestigial- , or double-sideband (SSB, VSB, or 
DSB)—or the signaling scheme (Nyquist or partial-response). Appli- 
cation of these principles to Class IV partial-response systems is con- 
sidered. For private line systems and systems where amplitude 
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distortions in the communication channels are not severe (delay 
distortions can be arbitrary), the new equalizer can be implemented by 
simply adding a prefixed weighting matrix to the conventional trans- 
versal equalizer. Analysis and computer simulation show that the 
use of such a new equalizer can result in a significant reduction in the 
system’s start-up time. 


I. INTRODUCTION 


In order to meet the needs of: the rapidly growing computer and 
data processing industries, a number of high-speed data sets have been 
developed in recent years for voiceband telephone channels. Most of 
these data sets use transversal filterst for precise automatic equaliza- 
tion. The transversal equalizer consists of a tapped delay line with 
variable tap gains. During a start-up period prior to data transmis- 
sion, the tap gains are adjusted automatically to minimize the peak 
distortion? or the mean-square error?*+ of the received pulses. The 
time required to adjust the tap gains to nearly their optimum settings 
is usually called the settling time of the equalizer. Most automatic 
equalizers have settling times of a few seconds. The start-up time of 
the system can be longer because there are usually other operations to 
be performed in the start-up period (operations such as synchroniza- 
tion, carrier recovery, and so forth). 

Recently, much attention has been focused on the possible use of 
the high-speed data sets in private-line multiparty polling systems 
(such as airline reservation systems, on-line banking systems, and so 
forth). Such systems are generally real-time information retrieval 
systems where the inquiry and response are short (the message lengths 
are usually less than 1000 bits®). With a 4800 b/s data set, it takes 
only 0.208 second to transmit a message of 1000 bits. Consequently 
the actual transmission time can be much less than the start-up time 
of the system (which can be five seconds). In order to allow addi- 
tional stations to be served or to reduce the response time of the 
system (response time is important in real-time systems), it is 
necessary® to reduce the start-up time of the high-speed data sets 
drastically. This means that the settling time of the automatic 
equalizer must be reduced drastically (for example, from a few seconds 
to tens of milliseconds). 

Such a drastic reduction in the start-up time of the high-speed data 
sets raises a number of theoretical questions. For example, it is no 
longer sufficient to just prove the convergence of the equalizer adjust- 
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ment; it must be shown that the convergence is sufficiently fast to 
meet the requirement. It becomes necessary to examine the dependence 
of equalizer settling time on modulation and signaling schemes. Instead 
of considering synchronization, carrier recovery, and automatic equali- 
zation separately, one has to consider the mutual dependence of these 
adjustments and the possibility of minimizing the overall adjustment 
time. The start-up time requirement also provides a strong motivation 
to search for a new equalizer that has a settling time shorter than 
that of the conventional transversal equalizer. Answers to some of 
these problems are presented in this paper. Because the paper is 
lengthy, the contents of each section are outlined here; the results 
are summarized in Section VIII (the reader may read Section VIII 
first). 

Section II includes a description of the mathematical model and 
reviews some of the fundamental works on automatic equalization 
(particularly those by Lucky and Gersho). Section III examines 
the equalizer settling time for two important classes of digital com- 
munication systems: the Class IV partial-response system and the SSB 
Nyquist system. (The latter represents the limiting case of VSB 
systems with small roll-off bandwidth.) Surprisingly, the results show 
that the equalizer settling times of these two systems can be very 
different. The reason for this difference (eigenvalue spread) is ex- 
plained so that the method of analysis can be extended to other sys- 
tems. Sections IV and V consider a general data communication 
system and develop a new equalizer structure for fast start-up pur- 
pose. These sections stress the underlying principle (condition of 
orthogonality) and analyze the various properties of the new equalizer 
(including convergence rate of equalizer adjustment, residual noise 
power, minimum mean-square error, and convexity of the adjustment). 
In Sections VI and VII, application of the new equalizer to the Class IV 
partial-response system is considered. Most importantly, it is shown 
that the new equalizer can be implemented by simply adding a pre- 
fixed weighting matrix to a conventional transversal equalizer. The 
related analytical studies and computer simulation are described; 
fast convergence of the equalizer adjustments is demonstrated. Sec- 
tion VIII is a summary of the results. 


II. REVIEW OF FUNDAMENTALS 


An amplitude modulation data communication system utilizing a 
transversal equalizer is depicted in Fig. 1. The equalizer consists of 
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Fig. 1—Block diagram of an amplitude modulation data communication sys- 
tem with a conventional transversal equalizer. 


a delay line tapped at J-second intervals, where 7’ is the signaling 
interval of the system. The 7th tap, 7 = 1 to N, is connected through a 
variable gain control c, to a summing bus. During data transmission, 
the transmitter transmits the information digits sequentially at time- 


instants ¢ = --- ,t; — T,t,,t, + T, t, + 27, +--+ . The equalizer 
output is sampled sequentially at time-instants t = --: , & — T, 
to, te + T, te + 2T, +++ , and the time-samples are used to recover 


the information digits. To simplify the notations, we shall shift the 
origin of the time-axis to make t. = 0. 

When an impulse 6(¢ — ¢,) is applied at the transmitter input, the 
equalizer input and output are, respectively, x(t) and y(t). Since we 
consider only linear systems, y(t) is the overall impulse response. The 
desired overall impulse response of the system is d(t). In this study, 
we adopt the familiar mean-square error criterion®**® and adjust the 
gain controls of the equalizer to minimize the mean-square error 
between y(t) and d(t) at the receiver sampling instants t = --- 
—T, 0, T, 27, --- . The mean-square error can be written as 


? 


oe] 


‘= ps [y@T) — d@T)/. (1) 


=-0 


It can be seen from Fig. 1 that 


yi) = Seale — & — YT. (2) 


For the sake of simplicity, we shall use the abbreviations y; = y (tT), 
i = d(T), and x; = x(iT). It can be seen from (2) that (1) can be 
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written in the following matrix form 


e=c/Ac—2cv+ >> di, 


i=-0 
where 
Cy 
C2 
c= ; 
Cy. 
Qi. Ayn2 8 tt AN 
Rese Q21 Qe2 *** Gen 
Qni1 Gn2 °*°* Gyn. 
foe) 
ai = 2s LiniaiHi-j+1, All 4,9; 
l=-0 
Vy 
v 
leks ie 
ON. 


v0, = a2 t:-p410; ; all k. 


i=—00 


It can be shown that A is positive definite. 


1973 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


Let de/dc; be the partial derivative of « with respect to c; , 1= 1 to 
N, and let de/dc represent an N Xx 1 column vector whose zth element 


is de/OC;; 1.€., 


oe 


0c, 
de _ SE], 
dc +s, 

dé 


dew 
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From (1), we obtain 


< = 2Ac — Qv. (9) 
The measured de/dc in actual data sets will deviate* from that in (9) 
due to noise in the receiver. However this deviation is negligible in 
high signal-to-noise ratio systems. For example, it has been found 
' (Section 7.3) that at a 30-dB signal-to-noise ratio this deviation has 
only a minor effect on the settling time of the equalizer. In this paper 
we consider high signal-to-noise ratio systems and neglect such a 
deviation. 

The optimum value of c that minimizes the mean-square error e will 
be called copt. It is clear that c minimizes « if and only if de/dc = 0; 
therefore, from (10) the optimum c is 


Cot = A’‘V. (10) 

The difference between c and Copt is denoted by e: i.e., 
e=c—A’’y. (11) 
Let «min be the minimum value of « when ¢ = cy. From (3) and (10) 
Emin = yy d; — v’A7'v. (12) 


Now consider the adjustment of the equalizer. As is well known,?? 
the equalizer can be adjusted in the training period prior to data 
transmission by transmitting either a succession of isolated test pulses 
or a sequence of Pseudo-random numbers. Since’ these two methods 
differ considerably, we shall consider only the first method (isolated 
test pulses) in this paper. 

In the training period, isolated impulses are applied to the trans- 
mitter input. For instance, 6(t — ¢,) in Fig. 1 may be the kth such 
impulse. The transmission of §(¢ — ti) produces the test pulse x(t) at 
the equalizer input. From x(t) the partial derivatives de/dc;, 27 = 1 
to N, are computed*** and the gain control c; is changed by an amount 
proportional to de/dc;. This process is then repeated for the next 
test pulse. 

The adjustment made after the kth test pulse, k = 1, 2,3, -+- , will 
be referred to as the kth adjustment. The initial values of c, e, and « 
(i.e., their values prior to the first adjustment) will be denoted, 
respectively, by cy, @9, and «,. The values of c, e, and « after the 
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kth adjustment are denoted, respectively, by c,, e,, and «,. From 
(11) 


e,=c,—-A‘’v k=0,1,2::-. (13) 
It can be easily shown from (3), (12), and (13) that 
€. = €min t efAe,, &k=0,1,2,---. (14) 
The kth adjustment is made according to the equation 
1 0 
Cy, = Chey — 4 oy| 2] ) (15) 


where a,, k = 1, 2,3, +++ , are suitably chosen constants.* 
The subscript k of d«/dc indicates that the de/dc is computed from 
the kth test pulse. It can be shown from (9), (15), and (13) that 


e, = (I— mA)e:1, &=1,2,3,-°-. (16) 


We now proceed to study the convergence of the mean-square error 
for several data communication systems. 


III. A STUDY OF CONVERGENCE FOR PARTIAL-RESPONSE AND NYQUIST 
SYSTEMS 


We have defined e, as the value of ¢« after the kth adjustment. Note 
from (14) that ¢, consists of two terms. The first term éna;, is the ir- 
reducible value of «. Only the second term e/Ae, depends on c and the 
adjustments. Thus, we shall study the convergence of the second 
term in this section. 

It can be seen from (6) that A is a symmetric matrix. Let the eigen- 
values of A be denoted, in the order of increasing magnitude, by ); , 
t = 1 to N, so that 


Ar Se St: Say, (17) 


and let u;,2 = 1 to N, be a set of orthonormal eigenvectors of A (u; is 
the eigenvector corresponding to ;). It is well known that A can be 
represented in the form 


A = QDQ’, (18) 
where D is an N X N diagonal matrix whose ith diagonal element is 
d,, and Q is an N X N matrix whose ith column is the eigenvector u, . 
It is also well known that Q is an orthogonal matrix; Le., 


QV = Q*. (19) 
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From (18) and (19), 


I — aA = QUI — a,D]Q’. (20) 
By repeated application of (16), one obtains 
ee; = (I = a,A)(I — a,-1A) ee (I = a,A)(I rae a, A)e, 
k 


I] d-aA)e., k=1,2,3,-°-. (21) 


n=1 


Substituting (20) into (21) and noting that Q’Q = I gives 


é= | I as cD) |Q’en k=1,2,3,---. (22) 
From (22) and (18), 


efAe, = 1 I ae a.) |D| IT ccm aD) |O'e, ; 


k=1,2,38,+:-. (28) 


Using the properties of D and Q described one can carry out the 
matrix multiplications in (23) and obtain 


ejAe, = > 8h), k= 1, 2,3, +++. (24) 
where 
E(k) = (eiu*ns| TT (1 — ad) | ; r=1 to N. (25) 
In a similar manner, we find that 
ethe, = 80), (26) 
where 
£:(0) = (efu,)"r; , t=1toN. (27) 


Since A is positive definite, \, > 0 for all 7. Thus, &;(4) = 0 for all 7. 
Consequently, ejAe, converges to zero if and only if £,(k), 7 = 1 to N, 
all converge to zero (see (24)). For this reason, £;(k) will be called the 
ith error component. If the error components all converge rapidly 
to zero as k increases, ef Ae, converges rapidly to zero. 

It is clear from (25) that £;(k) converges to zero if and only if the 


factor 
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| TL a — aay | 


converges to zero. Thus, the convergence of £,(k) to éy(k) depend 
only on two sets of parameters: a;, °** , a, and Ay, °°: , Ay. The first 
set of parameters corresponds to the magnitudes of the gain-control 
adjustments [As can be seen from (15), a, determines the magnitude 
of the kth adjustment.]. In the following subsections, we show that the 
second set of parameters, A1, *** , Ay, depend on the modulation 
scheme and channel characteristics. In some systems, A1, °°: , Ay 
differ only slightly in value. Consequently a, can be selected such 
that each adjustment reduces each of the error components by a large 
factor (such as 100). However, this is not possible in some other 
systems. , 


3.1 Class IV Partial-Response System 

There are several classes of partial-response systems.’ We shall 
consider the most important one: Class IV partial response sys- 
tem. 10111213 The results can be easily extended to other classes. 

As depicted in Fig. 1, the transfer functions of the transmitting 
filter, transmission medium, and receiving filter are, respectively, F(f), 
F.(f), and F’3(f). The amplitude and phase characteristics of F;,(f) 
will be denoted, respectively, by |Fi(f)| and B;(f), ie., 


FQl=|FiG le", i= 1,2,3. (28) 


Note that in this paper J will be used to denote the imaginary number 
VV —1 (7 is used as an index). 

In a Class IV partial-response system, the transmitting and the 
receiving filters are band-limited; that is, 


| Fi)Fs(f) | = 0, 


when 


lfl Sh and |f{ 2h, (29) 


where f; and fe are, respectively, the lower and the upper cutoff 
frequencies. The demodulating carrier frequency, f, , is usually equal to 
fe. This implies that the system is SSB. According to the previous 
definition, when an impulse 8(¢ — ¢,) is applied at the transmitter 
input, the equalizer input is x(t). Let X(f) denote the Fourier trans- 
form of x(t). It can be shown from (28), (29), and the demodulation 
process that 
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X(f) 


3 Ff = fF .(f _ fF's(f — f.)| 
-exp {J[6i(f — f.) + Bo(f — fe) + Ba(f — f-) — 2a(f — ft + 41}, 
OSfSh—h 


2G + f)Fof + f.)Pa(f + f.)| 

‘exp (J[Bi(f + f.) + Bof + fo) + Bs(f +f.) — 2r(f + fat — 4)}, 
—-G-—f) S780 

= 0, all other frequencies. (30) 


Now we can determine the A matrix. From (6), the elements of A 
are 


a; = 2 U1-i41%1-ja1 + (6) 
In a Class IV partial-response system the signaling interval is 
ee (31) 
2(fe ie hi) 
It is clear from (80) that x(¢) is band-limited from 0 to (fe — f1) Hz. 
Thus, the time-samples x(kT), k = --: , 0, 1, 2, --- , are taken at 
the Nyquist rate. Therefore, according to the sampling theorem 
fo ste — or + Pll — + Plt = Ae OD teeter. 
—2 (fe a fi) =o (32) 


Comparing (6) and (32) shows that 


eee ee ‘s alt — iT + Tlalt -—j0 + T] dt. (33) 


From Parseval’s theorem and (80), one can rewrite (33) as 


ais = —) [ ” [eos afi — HP] 


WAG — f)Pof — f)Fs(f — f)\P df. (84) 
It can be seen from (34) that aj; is independent of the following 
parameters: demodulating carrier phase @, system timing ¢,, phase 
characteristics of the transmitting and receiving filters, and phase 
characteristic of the transmission medium. To evaluate a; , we need to 
specify only the amplitude characteristics |Fi(f — fo)|, |Fo(f — fe) |, 


and |Fs(f — f-)|. In a Class IV partial-response system, the trans- 
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mitting and receiving filters are designed such that 





IA 


rl rato =sine[ Le], pst st, 





= sing| =A], -fSfS—h. (8) 


Substituting (35) into (34) gives 
feats 
a= (2-4) f (cos 2effi - AT 
eee ae _ a 
J (sin = 7 = -) [Fo(f 101 df. (36) 


We first consider the case where the transmission medium has a 
constant amplitude characteristic 


lFo(f| = 1 (37) 


in the pass band f; S f S fo. Substituting (37) into (86) and evaluat- 
ing the integral, we obtain 


2 
a,, = 2h) i—j=-2 or 2 
—_ 2 
(fo =H : t—-j=0 
= 0, all other 7 — 7. (38) 


The constant term (f2 — f,)’/2 in a;; may be dropped (this corresponds 
to introducing a gain of V2/(f. — f:) in the channel). Then (38) becomes 


a;; = —3, t—j=-2 or 2 
=<]. ~—j=0 
= 0, all other 2 — j. (39) 


Thus, the matrix A is of the following form 


1 0-2 0 
a. 2 ee 
1 
0 


ble 


| 
vl 
(om) — 


(40) 


a) 
| 
Nie 
rH oO 
Ter oS 





| 0 0 OO O-- 1 
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In particular the main diagonal elements are 1, the second off- 
diagonal elements are —1/2, and the other elements are all 0 (notice 
that A belongs to a class of matrixes known as the ‘‘Toeplitz’!*). We 
have defined A; to Ay as the eigenvalues of A, and uw, to uy as a set 
of orthonormal] eigenvectors of A. These eigenvalues and eigenvectors 
are determined in Appendix A. Since transversal equalizers usually 
have an odd number of taps, we shall assume that N is odd in the 
following discussion. From Appendix A, the N eigenvalues of A are 











- kr = N+1 
1 cosy? a aes ener Ga (41) 
2 
and 
iS eg peta Se, (42) 
N—1 2 
2 as 


Thus, the eigenvalues of A are points on the curve 1 — cos 6,0 < @ 
< a (see Fig. 2). The minimum eigenvalue 


1—- eo 
a ae 
e 0, 
Xd 
(72) 
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and the maximum eigenvalue 


N+1 





5 Tv 
L665: 
N+1 

eee +1 

are also shown in Fig. 2. Other eigenvalues lie between these two 
points. It can be seen from this figure that, for typical values of N 
(13, 17, etc.), the minimum eigenvalue is very close to 0 and the 
maximum eigenvalue is very close to 2. For example, consider a trans- 
versal equalizer with 17 taps (NV = 17). From (41) and (42), the 17 
eigenvalues of A are 0.0489, 0.06038, 0.1910, 0.2840, 0.4122, 0.5, 0.6910, 
0.8263, 1, 1.174, 1.809, 1.5, 1.588, 1.766, 1.809, 1.940, and 1.951. . 

When the amplitude characteristic of the transmission medium is 
not exactly a constant in the passband f, S f S fe, the calculations 
from (38) to (42) will change slightly. Consequently the eigenvalues 
will differ slightly from those above. 

‘Having determined the distribution of the eigenvalues, we now 
consider the convergence of the error components é,(k), 1 = 1 to N. 
Prior to the kth adjustment, the 2th error component is 


E(k — 1) = (es) (ks ad)? (43) 


After the kth adjustment, the ith error component is 


k 
£,(k) = (esu*n,] IT (1 a ad) | 
n=1 
Substituting (48) into the above equation gives 
£,(k) = (1 — Odi) E(k = 1), k = 1, 2, 3, eg (44) 


It is easily seen from (44) that the kth adjustment will reduce each 
of the error components by a large factor if 


(1 — a,d;)? <1, for all z 
or if 


ad 


; forall 7. (45) 


a l= 


Ak 


Unfortunately, a, cannot satisfy (45) because A1, *** , Ay are very 
different in value. Therefore, the kth adjustment cannot reduce each 
of the error-components by a large factor. To illustrate this quantita- 
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tively, consider again the 17-tap transversal equalizer (A; = 0.0489 
and \y7 = 1.951). In order to reduce the first error-component &(k), 
we set (1 — azd:)? K 1 or a, = 1/dA, = 20.4. However, this a, will 
cause a large increase in some other error-components. For example, 
the 17th error-component will increase about 1505 times because 
(1 — azdi1z7)? = (1 — 20.4 x 1.951)? = 1505. On the other hand, if we 
wish to reduce the 17th error-component, we should set (1 — axd17z)? 
“ 1, or a, = 1/diz7 = 0.512. However, this ox is too small to reduce 
some other error-components rapidly. For instance, the first error- 
component will reduce only five percent because (1 — axzdAi1)? = 
(1 — 0.512 x 0.0489)? = 0.95. These examples clearly demonstrate 
that the kth adjustment cannot reduce each of the error-components 
by a large factor. 
We now summarize the results in this subsection: 


(t) In a Class IV partial-response system, the eigenvalues \, to 
dw of the A matrix depend only on the amplitude characteristic 
of the channel, but not on carrier phase, system timing, and 
phase characteristic of the channel. 

(iz) These eigenvalues are very different in value. For typical values 
of N (13, 17, etc.), the minimum eigenvalue \, is very close 
to 0, while the maximum eigenvalue \y is very close to 2. 

(a7) Because of the large differences in the eigenvalues, each adjust- 
ment of the gain controls cannot reduce each of the error- 
components by a large factor. 


3.2 SSB Nyquist Systems 


In this section, we study SSB data communication systems which 
transmit at the Nyquist rate with sin x/x pulses (hereafter referred 
to as SSB Nyquist systems). Such a system has the same configuration 
as the Class IV partial-response system except that its transmitting 
and receiving filters have constant amplitude characteristics in the 
passband. The main advantage of an SSB Nyquist system is that it 
transmits at the maximum possible bauds (Nyquist rate) without noise 
penalty. Saltzberg*® has shown that this signaling method is not as 
sensitive to timing error as is commonly believed. In fact we show 
in this section that this scheme exhibits fast start-up advantages. 

The transmitting and receiving filters are specified by 


Fu) 


1, fi = lf = fe 


= 0 other f. (46) 


Vy) Van 
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and 


Ff) 


i; fi Ss If| = fo 
=O; other f. (47) 


The signaling interval T is again given by (31), and a; by (84). Sub- 
stituting (46) and (47) into (84), we have 


aus = (hh — fy [ "" [eos 2afli — NT] (Ff — fe af. (43) 


Note again that a;; (and hence the eigenvalues A; to Ay) depends only 
on the amplitude characteristics of the transmission medium and the 
filters. When the transmission medium has a constant amplitude 
characteristic 


| Fo(f) | =d 
in the passband f; =f S fe, (48) yields 
ai; = (fe — fi)’, ~—j=0 
= 0, ~—7 240. (49) 


Neglecting the constant (f2 — f1)? above, we see that A is simply the 
identifying matrix and the eigenvalues of A are 


A; = I, t= 1toN. (50) 


Now consider the convergence of the error components é;(k), 7 = 1 to 
N. Prior to the first adjustment, the 7th error component is 


£,(0) = (e/u,;)*A; . (51) 
After the first adjustment, the 2th error component is | 
E.(1) = (egus)°A,(1 — ad,)’. (52) 
Substituting (50) and (51) into (52) yields 
£,(1) = (1 — a1)"é,(0). (53) 


If we set a, = 1, &,(1) = 0 for all z. In other words, when a; = 1, the 
first adjustment reduces all the error-components to zero. Conse- 
quently the mean-square error e is reduced to its irreducible value 
€min after only one adjustment. This fastest possible convergence is 
obtained regardless of the initial equalizer settings, the carrier phase 
and system timing, the phase characteristic of the transmission 
medium, and the phase characteristics of the transmitting and receiv- 
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ing filters (because the eigenvalues \; to Ay are independent of all 
these parameters). 

From (48), aj can be computed for any given |F2(f)|. When 
|F2(f)| is not exactly a constant in the passband, A will not be an 
identity matrix and the eigenvalues will not be all equal. However, 
fast convergence is obtainable as long as the differences between the 
eigenvalues are small. For example, consider the case where the 
minimum eigenvalue is 0.9 and the maximum eigenvalue is 1.1. If we 
use a, = 1, the maximum value of (1 — a,A;)? will be (1 —0.9)? = 0.01. 
Consequently the first adjustment will reduce each of the error com- 
ponents by at least a factor of 100 (as will each following adjustment). 
The equalizer adjustment therefore will be completed after only a 
few adjustments. 


IV. A NEW EQUALIZER STRUCTURE 


In the preceding sections, we have clearly shown that, when the 
eigenvalues of A have close magnitudes, each adjustment of the 
equalizer reduces each of the error-components by a large factor (such 
as 100). We have also shown that such a fast convergence is not pos- 
sible when the eigenvalues are very different in magnitude (such as in 
the case of a Class IV partial-response system). These results suggest 
that in order to improve the convergence rate we should attempt to 
reduce the differences between the eigenvalues. 

Now we ask: What causes the eigenvalues to be different? Is tt 
possible to reduce such differences (and hence increase the con- 
vergence rate) by changing the equalizer structure? Does the use of 
the new equalizer structure alter the system’s performance otherwise? 
We shall consider the first two questions in this section, and derive a 
new equalizer structure. The last question will be considered in the 
next section. Application of the results to a Class IV partial-response 
system and the various related problems will be studied in Sections VI 
and VII. 

It may appear that the new equalizer structure derived in this 
section requires complicated mathematical operations (computation 
of eigenvalues and eigenvectors of a matrix). However, it will be 
shown in Section VI that such mathematical operations can be com- 
pletely eliminated for the systems of interest here. 

Now consider the first question: What causes the eigenvalues to 
be different? From the definition in (6) 
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eo 


as = Do all i+ YT eld — 7+ YT. 

Since the input to the ith gain control is x[t — ((-—1)T],2[(l-—24+ 
1)T] are time-samples of the input to the ith gain control, while 
x[(l — 7+ 1)T] are time-samples of the input to the jth gain control. 
Therefore, a;; 1s simply the cross-correlation between the inputs to the 
ith and the jth gain control, and A has the interpretation of a corre- 
lation matrix. Therefore, we might state that it is the correlations 
between the inputs to the gain controls that result in the differences 
between the eigenvalues. When the inputs to the gain controls are 
orthonormal, A is an identity matrix and the eigenvalues are equal. 
When the inputs to the gain controls are correlated, the eigenvalues 
are different. 

The above discussion suggest that, if we could construct a new 
equalizer such that the inputs to the gain controls are orthonormal, 
the eigenvalues of the correlation matrix of the new equalizer might 
be all equal and it might be possible to minimize the mean-square 
error in a single adjustment. Following this line of thinking, we obtain 
the generalized equalizer structure depicted in Fig. 3. As shown in 
Fig. 3, the equalizer input x(t) is connected to a bank of filters. The 
output of the ith filter is connected through a variable gain control c; 
to the summing bus. The equalizer output is 


y(t) = x cz.(t), (54) 


where z,(t) is the input to the zth gain control c;. These signals z;(¢), 
t= 1 to N, are orthonormal when 


ie) 


dD, 2lTz(IT) = 85; « (55) 
l=-0 
In the following, we show that, for any given channel, the filters in 
Fig. 3 can be designed to satisfy (55). We also show that, when (55) 
is satisfied, the mean-square error can be minimized by a single 
adjustment of the gain controls. 

Because of the press toward digitalization and integrated circuits, 
we shall specify the filters in Fig. 3 directly in digital filter form 
(instead of specifying them in analog filter form and then approxi- 
mating them with digital filters). We shall use nonrecursive digital 
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FILTER 


T(t) ig ee ee 


FILTER FILTER 






Fig. 3—Generalized equalizer structure. 


filterst® because they do not have stability problems and can be 
implemented most easily. 

A realization of the generalized equalizer using nonrecursive digital 
filters is depicted in Fig. 4. The N digital filters share the same tapped 
delay line. The ith digital filter, 2 = 1 to N, consists of the tapped 
delay line and the N coefficients P;; to Piy . Its output is 


z(t) = > Peal — (4 — 1)T], 7~=1 to N. (56) 


The gain controls c; to Cy are again adjusted to minimize the mean- 
square error ¢ defined in (1). From (54) and (56), we can rewrite (1) 
into the following matrix form 


e = C/PAP’c — 2c/Pv+ >> d; (57) 


k=—-@ 





y(t) 


Fig. 4—A new equalizer structure (a realization of the generalized equalizer 
structure using nonrccursive digital filters). 
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where c, A, and v have been defined previously [see (4), (5), and (7) ]. 
The matrix P is defined by 


Py, Pr. Sore Pry 
p= Po Po. ytate Poy |. (58) 
Py Pyo Sar Pyy. 


As in Section II, we define de/dc as the N X 1 column vector whose 
ith element is de/dc; . It can be shown from (57) that 


oe = 2PAP’c — 2Py. (59) 


It can also be shown from (57) that c minimizes ¢« if and only if 
de/dc = 0; therefore, from (59) the optimum value of c¢, Copt , is 
Cope = (PY) Av. (60) 
We again define e as the difference between c and Copt ; hence, from 
(60) 
e =c — (P’) ‘A’. (61) 
As in Section II, the initial values of c, e, and e are denoted, respec- 


tively, by co, €9 , and e). The values of c, e, and « after the kth adjust- 
ment are denoted, respectively, by c, , e, , and «,. From (61) 


e, = c, — (P’) ‘Ay, k =0,1,2,---. (62) 
It can be shown from (57) and (62) that 
¢ = >, d—wA'"v+e{PAP’e, k=0,1,2,---. (63) 
k=—0 
The kth adjustment, k = 1, 2, 3, --- , of the gain control is again 


made according to (15). From (62), (15), and (59), 


rod 


1 a) sade 
@, = Cr-1 — 2 | 2] — (P’) "A ‘Vv 
= ¢,_, — (P’)'A'v — a,[PAP’c,_, — Pv]. (64) 
From (62), we have 
Ox-1 = Cy — (P’) “Av. (65) 


Using (65), we can rewrite (64) as — 
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ee, = @,-1 — a,[PAP’c,_1 a Pv] 

@,-1 — a,PAP’[c,-1 — (P’) Av] 

[I = a,PAP’ Je,-, ) 


I 


k=1,2,---. (66) 
Now we can prove the first statement after (55) |that is, for any 


given channel, P can be chosen to satisfy (55)]. It is easily shown 
from (58), (5), (6), and (56) that PAP’ can be written in the form 


bi Bie we bin 
PAP’ = a bs aa a \ (67) 
byi bye e <5 byw. 
where 
bi; = » 2,(IT)z,(IT). (68) 
l=—00 
Equation (55) is therefore equivalent to bj; = $i , or 
PAP’ = I. (69) 
From (18), 
A = QDQ’. (18) 


Let H be an N X N diagonal matrix whose 7th diagonal element is 


/ x, , that is, 


Vx, 0 aoe: 0 
H = ‘ Vis _ : (70) 
0 0 Seam HAD Ne 
Then D = HH’ and (18) becomes 
A = QHH’Q’. (71) 
Substituting (71) into (69) yields 
PQH(PQH)’ = I. (72) 


Equation (72) holds if and only if PQH is an orthogonal matrix G. 
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From Q’Q = I, we can rewrite 


PQH =G (73) 
as 


P = GH"'Q’. (74) 


For any given channel |[i.e., for any given x(t)], we can compute aj; 
from (6) and determine the eigenvalues A; to Ay and the eigenvectors 
u, to uy. This determines D, Q, and H. There are an infinite number 
of N x N orthogonal matrices. Any of them can be used as G (the 
choice of G will be discussed later). Then from (74) P can be deter- 
mined, and this P satisfies (72), (69), and (55). This proves that, for 
any given channel, the digital filters in Fig. 4 can be designed to 
satisfy (55). 

Next we prove the second statement after (55). As in Section II, 
We USE €min to denote the minimum value of « when c = Cop . Substitut- 
ING Copt In (60) into (57) yields 


Emin = >, di — WAY. (75) 
Combining (63) and (75) gives 
€. = €min + e{PAP’e, k =0,1,2,---. (76) 
From (76), the initial value of ¢ is 
€) = €min + @j/PAP’e, , (77) 
and the value of ¢« after the first adjustment is 
€: = €min + e{PAP’e, . (78) 
From (66), e; = [I — a,PAP’]eo . Substituting this into (78), we have 
€: = Emin + C4[I — a,PAP’JPAP’[I — a,PAP’e, . (79) 


when P satisfies (74), (69) holds; i... PAP’ = I. If we set a; = 1, 
I — ajPAP’ = 0. Then from (79) « = enn. This proves that, when P 
satisfies (74) and a, = 1, the mean-square error « reduces to its 
minimum value enin after only one adjustment of the gain controls. 
In general it can be seen from (66) that, when P satisfies (74) and 
a, = 1, e, = [I — o,PAP’Jex_1 ‘= 0, regardless of the value of 
e,—1. Consequently « = min, regardless of the value of «1. This 
means that each of the adjustments is capable of reducing the mean- 
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square error to its minimum value emin , regardless of what is the value 
of « prior to that adjustment. 

Now we summarize this section. It is emphasized at the beginning of 
this section that it is the correlations between the inputs to the gain 
controls that determine the differences between the eigenvalues and 
consequently the rate of convergence of «. When the inputs to the 
gain controls are orthonormal, « can be minimized in one adjustment. 
The inputs to the gain controls can be made orthonormal by using the 
generalized equalizer structure depicted in Fig. 3. The generalized 
equalizer can be realized with digital or analog filters. A realization 
using nonrecursive digital filters is depicted in Fig. 4, and is analyzed 
in detail. It is shown that, for any given channel, one can design the 
digital filters from the simple equation (74) to orthonormalize the 
inputs to the gain controls. When the digital filters are so designed, 
each adjustment of the gain controls can reduce « to its minimum value, 
regardless of the value of « prior to that adjustment. Therefore, « 
can be minimized by only one adjustment of the gain controls. 


V. FURTHER PROPERTIES OF THE NEW EQUALIZER 


We have seen in the preceding section that it is possible to minimize 
e in one adjustment. In this section we show that such an improvement 
in convergence rate is obtained without changing the residue noise 
power, the minimum mean-square error, and the convexity of the 
adjustments, and without complicating the gain-control adjustment 
loop. 


5.1 Residual Noise Power and Minimum Mean-Square Error 

As described in Section II, during data transmission the equalizer 
output is sampled sequentially at time-instants t = -+- ,t — T, te, 
tg + T, tg + 27, --+ , and the time-samples are used to recover the 
transmitted information digits. Each of these time-samples consists of 
a signal and a noise component. The variance of this noise component 
(that is, the residue noise power) can be determined for both the new 
and the conventional equalizer. 

Consider first the new equalizer (Fig. 4). Let the noise at the input 
of the equalizer be denoted by n(t). The resulting noise at the 
input of c; is denoted by y;(t), while the resulting noise at the 
equalizer output is denoted by v(t). Clearly 


w= LPalt-G-VT, t=1 to NO 
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and 


v(t) = De emi(d). (81) 


We assume that the noise n(t) is a zero-mean stationary Gaussian 
process. Then p;(t),2 = 1 to N, and y(t) are also zero-mean stationary 
Gaussian processes. Consequently, the variance of v(t,) 1s the same for 
all ¢t,. It is therefore sufficient to consider a single ¢, (for example, 
t; = 0). From (80) and (81) 


N 


x0) = Die 2 P.n{(1 — pT}. (82) 


i=1 
Let n be an N X 1 vector whose zth element is n[(1 — 2) 7]. It can 
be easily shown from (82) that 


v(0) = c’Pn. (83) 


The noise vector n is distributed normally with zero mean. Let the 
covariance matrix of n be denoted by A. Then »(0) is distributed as 
the normal distribution with zero mean and variance c’PAP’c. Since c 
can be optimized in the training period, we assume that during data 
transmission ¢ = ¢,,, . Thus, during data transmission the variance 
of »(0) is Var [v(0)] = c/,,PAP’c,,, . Substituting (60) into above gives 


Var [»(0)] = vA TAA’. (84) 
Note from (84) that Var[y(0) ] is independent of P. 


Next consider the conventional equalizer. It can be easily seen that 
when P = I the new equalizer in Fig. 4 is reduced to the conventional 
equalizer in Fig. 1. Since Var[v(0)] in (84) remains unchanged when 
P is set to I, Var[v(0)] of the new equalizer is equal to Var[v(0)] of 
the conventional equalizer. In other words, the use of the new equalizer 
does not change the residue noise power of the system. 

The minimum mean-square error, enin , has already been determined 
in the preceding sections. Comparing (12) with (75) shows that the 
minimum mean-square error does not change when the new equalizer 
is used instead of the conventional equalizer. 


5.2 Convexity of the Adjustment 


It has been shown’ that, when the conventional equalizer is used, 
e is a strict convex function of the gain controls c. This ensures that e 
converges for all initial settings of c, and that e« converges when the 
gradient method [equation (15)] is approximated by certain other 
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iterative techniques. In this subsection we show that e remains as a 
strict convex function of c when the new equalizer is used. 

Let denote the open convex set {¢c: —-27 <c; << ~,7 = 1toN}. 
Let h be an N X 1 vector whose ith element is h; . Clearly,” ¢ is a 
function of class c on the set &. If we can show that 


YY ha, 2 > 0 (85) 


for every ce K and every h # 0, then ™ ¢ is strictly convex on ®. 
It can be shown from (57) that 
de 
0c; 0c; 





= 2b;;, all ¢@ and j (86) 
where 6;; has been pe in (68). From (86) and (67), we have 


de 
= 1 hihi; 0c; OC; 





= 2h/PAP’h. (87) 


yy 


It can be seen from (87) that (85) holds for every c « ® and every 
h ~ 0 if and only if PAP’ is positive definite. Since P is a nonsingular 
matrix [see (74)], and A is positive definite, PAP’ is positive definite. 
Therefore, (85) holds for every ce K and every h ~ 0, and consequently 
e is strictly convex on &. This proves that « remains as a strict convex 
function of c when the new equalizer is used. 


6.3 Gain-Control Adjustment Loop 


In the case of the conventional equalizer, the gain controls ¢ are 
adjusted according to (15). The partial derivative d«/dc; used in (15) 
is obtained® by correlating the time-samples of the tap signal and 
the time-samples of the error signal y(t) — d(t). 

In the case of the new equalizer, the gain controls c are again 
adjusted according to (15). Since 


= > cizi(t), 
Oe 0 ~ 
ers . Do (yr) — aan 


_9 o3, (kT) |[y(kT) — d(kT)], 


=1 to N. (88) 
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It is seen from (88) that de/dc; can be obtained similarly by correlat- 
ing the time-samples of z,(t) and the time-samples of y(t) — d(t). 
Therefore, the gain-control adjustment loop of the new equalizer is 
essentially the same as that of the conventional equalizer. 


VI. APPLICATION OF THE NEW EQUALIZER STRUCTURE 


In this and the next sections, we consider how to use the new 
equalizer structure for a Class IV partial-response system. It will be 
shown, both analytically and by computer simulation, that when the 
amplitude characteristic of the transmission medium does not vary 
appreciably from channel to channel (such as in the case of multiparty 
private line polling systems), we can use a fixed P matrix for the new 
equalizer (that is, P;; need not be adjusted in each training period). 
Such a simplification is possible even though the system timing, the 
demodulating carrier phase, and the phase characteristic of the trans- 
mission medium change from channel to channel. 

It has been shown in Section IV that e« can be minimized in one 
adjustment if a, = 1 and P satisfies the equation 


PAP’ = I. (69) 


As pointed out in Section 3.1, the elements a, of the A matrix are 
independent of the demodulating carrier phase, the system timing, the 
phase characteristics of the transmitting and receiving filters, and the 
phase characteristic of the transmission medium. Therefore, from (69), 
P is independent of all these parameters. It can also be seen from 
(36) in Section 3.1 that aj depends only on the amplitude charac- 
teristic |F2(f)| of the transmission medium (amplitude distortions of 
the transmitting and receiving filters are usually negligible). In some 
systems, such as private-line polling systems, |F2(f)| does not vary 
appreciably from channel to channel. For such systems, a fixed P 
matrix may approximately satisfy (69) for all channels. Therefore, in 
designing such systems, we may estimate |F2(f)| and compute from 
this an estimate of A (this estimate of A will be denoted by S). We 
can then estimate P from the equation 


PSP’ = J. (89) 


and use this estimated P for all channels. When the estimated |F2(f)| 
agrees with the actual |Fo(f)|, S agrees with A. Consequently the 
estimated P satisfies (69), and « is minimized in one adjustment. When 
the estimated |F2(f)| differs from the actual |Fe(f)|, the estimated P 
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will not satisfy (69) and the convergence rate of « will be reduced. 
In order to see whether the reduced convergence rate is satisfactory, 
we need formulas to relate the convergence rate to |F2(f)|. Such 
formulas will be derived in this section, and will be used in the next 
section for a hypothetical data communication system. 
Let the difference between the estimate of A and A be denoted by 
R, that is, 
R=S-A. (90) 


Since S is an correlation matrix, it is symmetric. Therefore, R is also 
a symmetric matrix. 
From (66) and (90), we have 
ee, = [I —_ a,PSP’ + a,PRP’Je,_1 ) k = 1, a Pe (91) 
As in Section IV, we set 
a, = 1, all k. (92) 


Since the estimated P is computed from (89), we can substitute (89) 
and (92) into (91) to obtain 


e, = PRP’e,_; ) k= i 2, oe ae (93) 
It can be easily seen from the recursive equation (93) that 
e, = (PRP’)‘e, , k=1,2,---. (94) 


From (76), the mean-square error after the kth adjustment is 
€&, = €min + e/PAP’e, e (76) 


We now evaluate the last term e{/PAP’e, in the equation above. It 
can be shown from (89) that 


P’/P = S", (95) 
From (95), (94) can be rewritten as 
e, = PR(S''R)* 'P’e, , k=1,2---. (96) 
From (96) and (95), we obtain 
e/PAP’e, = e/P(RS"')*A(S'R)*P’e,, k= 1,2,---. (97) 


Using A = S — R and (89), we can rearrange (97) into the following 
form 


e/PAP’e, = e/P(RS”')?*(I — RS™)P”'e, , 


> 
I 


1,2,---. (98) 


NEW EQUALIZER FOR FAST START-UP 1995 


From (77), the initial value of mean-square error is 


€> = €min + e{PAP’e, . (77) 

It can be easily shown from (90) and (89) that 
e/PAP’e, = e/P[I — RS"]P'e, . (99) 
Since A is symmetric, PAP’ is symmetric. Let £, , f,°+: , Cy be the 
eigenvalues of PAP’, and let w;, we, :*: wy be a set of orthonormal 


eigenvectors of PAP’ (w; corresponds to ¢;). From 


PAP’w, = CW; (100) 
and [see (99) ] 
PAP’ = P{I — RS’]P™ (101) 
we have . 
RSP 'w,; = (1 — ¢,)P'w,;. (102) 


It is seen from (102) that the 7th eigenvalue of RS’ is (1 — ¢,), and 
the ith eigenvector of RS’ is P"'w; . We shall denote the ith eigenvalue 
of RS” by i; so 


ZV =1-f,, 7=1toN. (103) 


Now consider the matrix product P(RS~’)**(I — RS~')P™ in (98). 
To facilitate writing, we shall denote this product by ®, that is 


® = P(RS”)?*(I — RS7))P™. (104) 


From the eigenvalue and eigenvectors of RS~’, we see that the ith 
eigenvalue of ® is \3‘(1 — 4,), and the 7th eigenvector of ® is w; . 
We shall denote the 7th eigenvalue of ® by 6; , so 


6; = Wd — 4). (105) 


We have shown in the above that the orthonormal eigenvectors 
w, to Wy of PAP’ are also eigenvectors of ®. Hence, PAP’ and ® can 
be simultaneously diagonalized by w, to wy . By such diagonalizations 
we can reduce (99) to the form 


N 
e/PAP’e, = >» (1 — ii) (eiw,)” (106) 
i=1 
and reduce (98) to the form 


N 
e{PAP’e, = >> N*(1 — Ad(w,)’, %kb&k=1,2,-:. (107) 
t=1 
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It can be seen from (106) and (107) that the convergence rate of « 
depends on 4, to Ay . Let | A; | max denote the largest | i; | among | A, | 
to | Ay |, that is, 





max 2 1h, ls a — 1 to N. (108) 


| A: 


Since PAP’ is positive definite, its eigenvalues must all be positive. 
Hence, 





CS Le hy 0. ~=1toN. (109) 
From (106) to (109), the following bound is obtained 
e(PAP’e, < [Ai |male/PAP’e,, & =1,2,---. (110) 


Based on (110), we have the first method of estimating the con- 
vergence rate of e/PAP’e, : 


First Method. From each | F.(f) |, compute 4, to Ay . This gives 
|X; |max - According to (110), es,PAP’e, reduces by at least a factor of 
|X; | max] after each adjustment. This completes the estimation. 

This method will be illustrated in the next section. In certain ap- 
plications, measurements of | F.2(f) | may not be available. It may 
only be specified that | F.(f) | varies within certain bounds. In such 
cases, | A; |max cannot be determined. However, it can be bounded 
from the bounds of | F,(f) | . It is shown in Appendix B that 


1 — [n(P)Imox SA: S1—{nflmin, t= 1toN. (111) 








where 


oa (F.(f a. ‘BPE 
1) = (la = folonl’ 
As explained in Appendix B, |Fe(f — fc) lact is the actual value of 
lFo(f — fe)|, |\Fe(f — fe) est is the estimated value of |F2(f — f.)|, 
[y(f) ]max 1s the maximum value of 7(f) in the frequency range 0 = 
f S fe — fi, and [y(f)]mn is the minimum value of »(f) in the 
frequency range of OS f S fe — fi. 


Based on (110) to (112), we have the second method of estimating 
the convergence rate of e{PAP’e, : 


Second Method. From the bounds of | F.(f) |, determine [n(f)|max 
and [n(f)|min from (112). If 


| Las [n(f) |min | > | l= [n(f) Jmax |; 
use | 1 — [n(f)]min | a8 the upper bound of | 4, |max . If 


| Le [n(f)]max | > | | ee [nf lnin | ) 


OSS fe— tis (112) 
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use | 1 — [n(f)]max | aS the upper bound of | 4; |max . Use the upper 
bound of | A; |max a8 | A; |max to obtain from (110) an estimate of the 
minimum convergence rate of e{PAP’e, . 

As illustrated in the next section, the second method requires only 
some simple hand-calculations, and hence may be applied first even 
if measurements of | F.(f) | are available. However, it should be warned 
that the second method is usually too pessimistic because the upper 
bound of | 4, |max , obtained in the above fashion, can be several times 
larger than the actual | i; |max . (Consequently, the convergence rate 
may appear to be unsatisfactory when it is actually satisfactory.) 
Therefore, for borderline cases, the first method should be used. 

We illustrate these two methods in the next section. 


VII. NUMERICAL EXAMPLES AND COMPUTER SIMULATION 


In order to illustrate the methods we consider a hypothetical private- 
line data communication system (hereafter referred to as the 
hypothetical system). We assume that the system uses a Class IV 
partial-response signal and transmits over voiceband at a baud rate 
of 4800 bauds. The cutoff frequencies f, and f, defined in Section 3.1 
will be 400 and 2800 Hz respectively. The demodulating carrier fre- 
quency f, is equal to f, . It is assumed that the new equalizer in Tig. 4 
is used with N = 13 and a prefixed P matrix computed from (89) 
(see footnote in Section 7.1). The two methods will be used in Sections 
7.1 and 7.2, which follow, to estimate the convergence rate of e/PAP’e, 
{the convergence rate of the mean-square error ¢ is identical to the 
convergence rate of e{PAP’e, [see (76) in Section IV]}. A third and 
very elaborate method, computer simulation of the data communication 
system, will be used in Section 7.3 to demonstrate the convergence 
rate. 


7.1 Hstumation by the First Method 


Since | F.(f) | of private lines does not deviate considerably from 
a constant in the frequency band 400 to 2800 Hz, we shall simply use 
a constant as the estimate of | F.(f) | in this frequency band, and 
compute a P matrix accordingly from (89). This prefixed P matrix 
will be used for all lines. 

Now consider what happens to the convergence rate when the private 
line has a | F.(f) | as depicted in Fig. 5. It can be seen that this | F2(f) | 
varies from —4.8 dB to 0.4 dB, and then to —6.3 dB in the frequency 
band 400 to 2800 Hz. Such a variation is large for private lines. How- 
ever, we show that even for this large variation the prefixed P matrix 


1998 THH BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


|Fo(f)| IN DECIBELS 
f 3 


{ 
o 


oO 400 800 1200 1600 2000 2400 2800 3200 
FREQUENCY IN HERTZ 


Fig. 5—The |F.(f)| used in Section 7.1. 


can be used and the convergence rate of the mean-square error is 


satisfactory. 
In order to use (110), we compute the eigenvalues 4, to Ay in the 


following steps: 

Step 1. Compute the elements a;; of the A matrix from (36) and 
the | F.(f) | in Fig. 5. It is sufficient to compute only the first row of A. 
To see this, note from (6) that a,; satisfies the condition 

Q;; = 4;: (113) 
and the condition 
Giz = Auth) ith) « (114) 
From (113) and (114), we see that A can be written in the form 


ay Aye A13 C4 eis Qin 
Ae Qi Ai2 Q33 "es Qy(WN=1) 
Qi3 Aye Qi Are see Ay(N-2 
A= meen le 9145) 
Aig Q13 Aye Qi ce) Ay (N=3) 
Qin Gicn-1) Arcw-2) Arww-sy °° *" Qi 


where the elements on the diagonal line are all equal to aj; , and the 
elements on each off-diagonal line are equal (for example, the ele- 
ments on the second off-diagonal line are all equal to a13). Thus, it is 
sufficient to compute only the first row of A. 
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Step 2. Compute the S matrix. It can be seen from Section 3.1 
(from (37) to (40)) that, when 


| F.(f) | = a constant, KS FS fas (116) 
we have 
A = a,,A, (117) 
where 
i Oak © 0 
O Sh . Ge aad - 0 
—$F 0 1 0 - 0 
“= oe i A See Ol wo 
0 0 0 Oi, eres 
Since we have used a constant as the estimate of |F2(f)|, we have 
Saud: (119)* 


Step 3. Compute R = S — A, and compute the eigenvalues 4, 
to A\y of RS™’. 

The computations in the above three steps can best be carried out 
by a computer program. Such a program has been written. For the 
| F.(f) | in Fig. 5, the eigenvalues of RS~’ are found to be: —0.1573, 
—0.1434, —0.0902, —0.0615, —0.0034, 0.0242, 0.0716, 0.1084, 0.1834, 
0.1945, 0.2954, 0.3797, and 0.4651. The largest magnitude of these 
eigenvalues is 0.4651; therefore, 


|X; max = 0.4651. (120) 
Substituting (120) into (110) gives 


£17 i e{PAP’e, . (121) 


It can be seen from (121) that e/PAP’e, reduces at least 4.62 times 


e,PAP’e, < | u 


* From (119) and (89), we may determine the P matrix. Since P is not needed 
for the following computation, we do not carry out such calculations here. How- 
ever, it should be pointed out that the method from (69) to (74) may be used 
for such calculations. It should also be noted that by setting the G matrix in 
(74) to I, we can reduce most of the elements in P to zero. For example, when 
N ig a multiple of four, it is sufficient to implement only N2/8 of the Pi; sin 
Fig. 4. This greatly simplifies the implementation. 
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after each adjustment. In other words, each adjustment reduces the 
mean-square error by at least 6.65 dB. This convergence rate is satis- 
factory. For example, let en;, be 0.01 and let the initial mean-square 
error €) be as large as 4 (see computer simulation in Section 7.3). From 
(76), (77), and (121), we see that after four adjustments (approximately 
16 milliseconds) the mean-square error e reduces to less than 0.0187. 
Clearly, this convergence rate is satisfactory. 


7.2 Estimation by the Second Method 


When measurements of | F,(F) | are not available, we may estimate 
the convergence rate from the bounds of | F.(f) |. For illustrative 
purposes, let us assume that | F.(f) | does not deviate from unity by 
more than —2.5 dB or 1.5 dB, that is, 


~2.5 < 20 log, | F2(f)| < 1.5, 400 < f < 2800. 


Notice that | F.(f) | can vary in any manner within these bounds. 
We simply use unity as the estimate of | F.(f) |. Then from (112), 
[n(f)lmax = 1.414 and [n(f)]min = 0.562. Since |1 — [n(f)]min| > 
|1 — [n(f)|max |, we use | 1 — [n(f)]min | a8 the upper bound of | 4; |max 5 
that is, |, |max < 0.438. Using | 4; |max = 0.438, we obtain from (110) 


k 
e/PAP’e, < EA e/PAP’e, . 


Thus, e{PAP’e, reduces at least 5.2 times after each adjustment (a 7.16 
dB reduction per adjustment). Notice that since the actual value of 
|X; |max May be much less than 0.438, the actual convergence rate can 
be much faster. 





7.3 Computer Simulation of the Hypothetical System 

The analytical methods in the preceding subsections yield bounds 
of the convergence rate. In order to demonstrate the actual con- 
vergence rate, we simulate the hypothetical system on the computer, 
using a number of private-line characteristics obtained from field 
measurement. In order to conserve space we describe here only the most 
important results. Fig. 6 shows the convergence of the mean-square 
error for a private line which has severe amplitude and phase dis- 
tortions (see Table I). Other lines simulated have less severe distor- 
tions and hence faster convergence rates. A 30-dB signal-to-noise 
ratio is assumed at the equalizer input. The bottom curve shows the 
convergence of e when the new equalizer is used (with a prefixed P 
matrix computed from (119) and (89)). It can be seen that, when the 
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Fig. 6—Convergence of the mean-square error. 


TasLe [—RELATIVE ENVELOPE DELAY AND RELATIVE Loss CHARAC- 
TERISTICS OF PRIVATE LINE USED IN SECTION 7.3. 


Relative Envelope 





Frequency Delay Relative Loss 
(Hz) (us) (dB) 
300 5500 6.3 
500 2830 2.4 
600 2060 1.9 
800 1040 0.6 
1000 590 0 
1200 390 —0.9 
1400 280 —1.3 
1600 150 —0.8 
1800 0 0 
2000 —100 0.7 
2200 — 80 1.0 
2400 15 2.2 
2600 270 2.7 
2800 260 4.5 
3000 1500 7.1 
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new equalizer is used, « converges rapidly to its minimum value, and 
the training period may be terminated after the third adjustment 
(three adjustments require approximately 12 milliseconds). These 
simulation results are in full agreement with the theoretical results 
in the preceding subsections. 

The top curve in Fig. 6 shows the convergence of « when the con- 
ventional equalizer in Fig. 1 is used (also with N = 18). It can be 
seen that the convergence of « is rather slow, and that it takes more 
than 11 adjustments (approximately 44 milliseconds) to reduce « to 
close to its minimum value. 

Figure 6 is obtained with a specific setting of system timing and 
demodulating carrier phase. The computer program has also been 
executed for a sufficiently large number of other timing and carrier 
phase settings. Curves similar to those in Fig. 6 have been obtained. 
The results may be summarized in Fig. 7. The horizontal axis in Fig. 7 
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Fig. 7—Number of equalizer adjustments required to reduce the mean-square 
error to less than 0.05 for different timing and carrier-phase settings. 
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shows the setting of the timing and the carrier phase, while the 
vertical axis shows the number of adjustments required to reduce 
the mean-square error « to less than 0.05. For example, when timing 
is set to 0.17 and the carrier phase is set to 90 degrees, three adjust- 
ments are required to reduce « to less than 0.05 when the new equalizer 
is used, while 11 adjustments are required when the conventional 
equalizer is used. It can be seen from the top curve in Fig. 7 that when 
the conventional equalizer is used, the number of required adjustments 
varies a great deal with timing and carrier phase, and as many as 19 
adjustments can be required. On the other hand, when the new equalizer 
is used (bottom curve), the number of required adjustments remains 
small for all time and carricr-phase settings. (This is due to the first 
convergence of the new equalizer.) Thus, when the new equalizer is used, 
it is not necessary to make fine adjustments of timing and carrier 
phase at the beginning of the training period. We may simply set these 
parameters to some reasonable values, and then use the new equalizer 
to quickly reduce the mean-square error. It is possible that the whole 
process of adjusting the timing, the carrier phase, and the equalizer 
can be completed in a brief training period. 


VIII. SUMMARY AND CONCLUSIONS 


Section II considers amplitude modulation data communication sys- 
tems that use transversal filters for automatic equalization. The gain 
controls of the transversal filters are adjusted by the gradient method 
[equation (15)] to minimize the mean-square error between the 
received and the desired pulses. After the fundamentals are reviewed, 
the mean-square error is decomposed into N error-components (N is 
the number of taps on the transversal filter). The error-components 
depend on the eigenvalues \1 , -:: , Ay of the correlation matrix A. 
These eigenvalues and the convergence of the error-components 
depend on the signaling scheme and the channel characteristics. Two 
important classes of digital communication systems are examined: the 
class IV partial-response system and the SSB Nyquist system. It is 
shown that for both systems, the engenvalues d;, -*: , Ay are inde- 
pendent of the demodulation carrier phase, the system timing, the 
phase characteristic of the transmission medium, and the phase 
characteristics of the transmitting and receiving filters. Consequently, 
the eigenvalues depend on only the amplitude characteristics of the 
transmission medium and the filters. Since amplitude characteristics of 
the filters depend on the signaling scheme, it might be said that the 
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eigenvalues depend on only the signaling scheme and the amplitude 
characteristic |F2(f)| of the transmission medium. For applications 
where |F2(f)| does not vary considerably from channel to channel 
(such as private-line systems), the eigenvalues depend primarily on 
the signaling scheme. For the SSB Nyquist system, the eigenvalues 
1, *** , Ay are nearly equal. Consequently, as shown in Section 3.2, 
each adjustment reduces each of the error-components by a large 
factor, and the mean-square error can be minimized after only a few 
adjustments (for example, two adjustments). For the class IV partial- 
response system, the eigenvalues 41, -*: , Ay are very different in 
magnitude. (The ratio of the maximum eigenvalue to the minimum 
eigenvalue is very large; e.g., 40 or larger.) Therefore, each adjustment 
cannot reduce each error-component by a large factor, and, as can be 
seen from Figs. 6 and 7, the convergence rate may not be satisfactory 
for fast start-up purposes. 

The results in Section IIT suggest that in order to improve the con- 
vergence rate an attempt should be made to reduce the differences 
between the eignevalues. As emphasized in Section IV, differences in 
eigenvalues are caused by the correlation between the inputs to the 
gain controls. When the inputs to the gain controls are orthonormal, 
the eigenvalues are all equal and the mean-square error can be min- 
imized in only one adjustment. The inputs to the gain controls can be 
made orthonormal by using the generalized equalizer structure in Fig. 
3. The generalized equalizer can be realized with digital or analog 
filters. Because of the press toward digitalization and integrated cir- 
cults, digital filters are used. A realization using nonrecursive digital 
filters is depicted in Fig. 4. This equalizer structure, referred to as 
the new equalizer structure, is analyzed in detail. It is shown that for 
any given channel the coefficients of the digital filters can be set 
according to (69) or (74) to make the inputs to the gain controls 
orthonormal. Then each adjustment of the gain controls can reduce 
the mean-square error « to its minimum value, regardless of the mag- 
nitude of « prior to that adjustment. In Section V it is shown that such 
a fast convergence (one-step convergence) is obtained without chang- 
ing the residual noise power, the minimum mean-square error, and the 
convexity of the adjustments, and without complicating the gain- 
control adjustment loops. 

The theory in Sections IV and V is general in that it applies regard- 
less of the type of modulation (SSB, VSB, or DSB) or the signaling 
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scheme (partial-response or Nyquist). Sections VI and VII consider 
the application of the theory to a Class IV partial-response system. 
Most importantly, it is shown that the coefficients P;; of the digital 
filters in the new equalizer can be prefixed. It is first observed that 
for Class IV partial-response systems the coefficients P;; in the new 
equalizer depend only on the amplitude characteristic |F2(f)| of the 
transmission medium. Consequently, for private-line systems and sys- 
tems where |F2(f)| does not vary considerably from channel to channel 
(delay distortion can vary arbitrarily), we may compute the co- 
efficients P;; from an estimated (typical) |F2(f)| and use these pre- 
fixed P;; for all channels. One-step convergence is obtained when actual 
|F2(f)| agrees with the estimated |F2(f)|. When actual |F2(f)| differs 
from the estimated |F2(f)|, the convergence rate of the mean-square 
error is reduced. In order to determine if the reduced convergence rate 
is satisfactory, two analytical methods are developed to relate the 
convergence rate to |F2(f)|. These methods are first presented in 
Section VI, and are used in Section VII for a hypothetical data com- 
munication system. This hypothetical system uses a Class IV partial- 
response signal and transmits over private voice lines at a data rate 
of 4800 bauds. For private lines, |F2(f)| does not vary considerably in 
the passband 400 to 2800 Hz; therefore, a constant is used as 
the estimate of |F2(f)| and the prefixed P;; are computed from 
(89) and (119). It is shown analytically in Section 7.1 that the 
use of the prefixed P;; yields a satisfactory convergence rate even 
when the line has severe amplitude distortion. To further demon- 
strate the convergence rate, the hypothetical system is simulated on a 
digital computer, using the prefixed P;; and a number of private-line 
characteristics obtained from field measurements. A 30-dB signal-to- 
noise ratio at the equalizer input is assumed. Figs. 6 and 7 illustrate 
the convergence of the mean-square error for a private line that has 
severe amplitude and phase distortion. (The convergence is faster 
for other lines.) It can be seen from these figures that, when the new 
equalizer is used, the equalizer settles after three adjustments (ap- 
proximately-12 milliseconds). Furthermore, because of the fast con- 
vergence, the settling time remains small for all settings of timing and 
demodulating carrier phase. Thus, when the new equalizer is used, it is 
not necessary to make fine adjustments of timing and demodulating 
carrier phase in the start-up period. These parameters can simply be 
set to some reasonable values at the beginning of the start-up period. 
This can further reduce the overall start-up time of the system. 
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APPENDIX A 


A.1 Determination of Eigenvalues and Eigenvectors 


In this appendix we determine the eigenvalues and eigenvectors of 
the N x N matrix A in equation (40). The two cases of odd N and 
even N are considered separately. 


Case 1. Odd N. 


Let A be an eigenvalue of A, and let — 


u= (122) 


Un 
be the corresponding eigenvector. We can rewrite the equation A u = 
AU as 
N 
>. a;j;U; = Nu; ; a = 1 to N. (123) 


7=1 


Equation (123) can be split into the following two equations: 


N 
» a;;U; = NU; ) 1 = odd (124) 
j=1 
and 
N 
>) a;ju; = Mu; , 1 = even. (125) 


j=1 
From (89), aij; = 0 when i — jis odd. Thus (124) can be reduced to 
>> Q;jU; = NU, , a= 1,3,--: ,N. 
7=1,3,°°°,N 


Using (89), we can write the above equation as 
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1 —-3 0 0 O}} wy Ui 
—i 1 —i 0 of 0 U3 U3 
0 0 —i 1 ” 0 Uz U7 
0 0 0 0 2: 8s 1 UN. UN 


The first matrix on the left-hand side of (126) is an (N + 1)/2 X 
(N + 1)/2 tridiagonal matrix. In addition to the trivial solution 


U,=—= UW = Uu= one = Uy = 0, 
there are (N + 1)/2 nonzero solutions to (126). The kth such solu- 
tion, kK = 1,2,--- , (N + 1)/2, is* 
kr 











A=1- Wee Waly (127) 
2 
and 
U sin a 
, N+1 
para 
¢ k2ar 
Us sing 
Fe z ee (128) 
N+1 
got | : 
2 
NAL, 
Un -)) a 
N+1 
2 ce 
From (39), (125) can be written as 
1 —4 0 0 U2 U2 
—4 ] —% 0 Ua Us 
0 Us =X Us ° (129) 


oO 

| 
- ble 
—_ 


0 0 0 see DL] Ltw-ty |U(n-1) 
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where the first matrix on the left-hand side is an (VN — 1)/2 X 
(N — 1)/2 tridiagonal matrix. In addition to the trivial solution 
U2 = Ug = Up = °°? |= Ue) = 0, 


there are (N — 1)/2 nonzero solutions to (129). The kth such solu- 
tion, k = 1,2,-++ , (N — 1)/2, is 

















j=1- Pe ee (130) 
Ne ad 
2 
and 
U sin kee 
: N-1 
2 pia 
U sin kn 
4 — 
2 ee 
=|] 2 (131) 
eae a4, : 
k a i u T 
U(n-1) sin a a 
Me 1 44 


It can be shown that the A given by (127) is not equal to that given 
by (130). Therefore, (126) and (129) do not have the same nonzero 
solution. 

We have split (123) into (124) and (125), and rewritten (124) and 
(125), respectively, as (126) and (129). Thus, A and u satisfy (123) 
if and only if they satisfy both (126) and (129). Since (126) and 
(129) do not have the same nonzero solution, A and u are given either 
by the nonzero solution of (126) plus the trivial solution of (129), or 
by the nonzero solution of (129) plus the trivial solution of (126). 


The above may be summarized in the form of a lemma: 
Lemma A-1: For odd N, the eigenvalues and eigenvectors of A may be 


divided into two groups. The kth eigenvalue and eigenvector, k = 1, 2,--:, 
(N + 1)/2, in the first group are given by (127), (128), and 


U2 = Ug = Ug = 28 t = Uwn-1) = Q. 
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The kth eigenvalue and eigenvector, k = 1, 2, --- , (N — 1)/2, in the 
second group are given by (130), (131), and 
Uy = Uzy = Us = +: = Uy = 0. 


It can be shown that the eigenvectors specified in Lemma A-1 form 
a set of N orthonormal eigenvectors. 


Case 2. Hven N 


Using the same method as in Case 1, one can verify the following 
lemma: 


Lemma A-2: For even N, the eigenvalues and eigenvectors of A may 
be divided into two groups. The kth eigenvalue and eigenvector in the first 
group, k = 1, 2, --- , N/2, are given by 














\ = 1 — cos ¥. ; (132) 
2 
and 
U1 sin ec 
og tl 
Us 3] sin baat 
= li z a0 (133) 
oti 
N 
Uwn-1) sin ae 
oo 
and 
Ug = Us = Up = '*** = Un = 0. (134) 
The kth eigenvalue and eigenvector in the second group, k = 1, 2, --- , N/2, 
are given by 
kr 





\ = 1 — cos (135) 


oti 
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and 
uU sin her 
2 
N 
2 +1 
U sin kon 
4 
N 
3 2 
= ra gus (136) 
U sin bgt 
N 
N 
2 ae 4 
and 
Uy = Us = Us = ++ = UWw-1) = 0. (137) 


The eigenvectors given by (133) and (134) together with the 
eigenvectors given by (1386) and (1387) form a set of N orthonormal 
eigenvectors. 


APPENDIX B 


B.1 Bounds on Eigenvalues 

It has been defined in Section VI that \; ,7 = 1 to N, are the eigen- 
values of RS”*. In this appendix we study the relation between i; 
and the amplitude characteristic | F.(f) | of the transmission medium. 
It is easier to first study the relation between | F.(f) | and the eigen- - 
values of AS~’. Let X denote the eigenvalue of AS~’. Since AS" and 
S~*A have the same eigenvalues, X is also the eigenvalue of S"’A. Thus 


SAX = Xk, (138) 
where X is the eigenvector of S"*A. Premultiplying both sides of (138) 
with &’'S gives 

AX = hz’/SE. (139) 


Since S is a correlation matrix like A, S is positive definite. Consequently 
x/S& > 0 for every X except ¥ = 0. Since & is an eigenvector, X ¥ 0. 
Therefore 
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From (139), we have 


"AX 
’Sa 


par 


i= (140) 


| 


In this appendix, c is the vector defined in (4) for the conventional 
equalizer. Let [(c’Ac)/(c’Sc)]min denote the minimum value of (c’Ac)/ 
(c’/Sc) over all c ¥ 0, and let [(c’Ac)/(c’Sc)]n.. denote the maximum 
value of (c’Ac)/(c’Sc) over all c # 0. Then since the eigenvectors x 
are elements of the set {c | c ¥ 0}, we have 


| c’Ac | x Ax’ | c’Ac | 
petBlt < < 
ce’ Sclmin  X SX — Le’ScIimax (141) 


Now we evaluate the two bounds in (141). Using the notations in 
Section II, we can easily show that 








c’Ac= >> yj. (142) 


i=—00 


Since y; are time samples of y(t) taken at the Nyquist rate, we have 
by sampling theorem 


DL v= 2h-f) | wor ae. (143) 
According to Parseval’s theorem 


[ wora= [vor a, (144) 


where Y(f) is the Fourier transform of y(t), and Y*(f) is the com- 
plex conjugate of Y(f). From (2), we obtain 


Y(f) = X(fe"**"* Dae (145) 
Substituting (145) into (144) gives 
[wor a= [ xor | Secret || See] a. cso 
One can rearrange (146) into the following form 


f . YOY dt = 2 i “IXOP af af, (147) 


where 
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a) = | > C, COS anf | ois e c, sin auf | (148) 


Since we are considering a Class IV partial-response system, we can 
use (30) in Section 3.1 to determine |X (f)|, and use (35) for the 
product lFi(f — f-)Fs(f — f-)| in |X (f)|. From these equations, (147) 
can be reduced to the following form 





fa-f 2 
[wora=5 [0 ing - tor [sin z L) a(f) df. (149) 
Combining (142), (148), and (149) gives 





plata af 2 
c’Ac = (f2 — fi) i lFo(f — f2)/’ [sin 7 = - | q(f) df. (150) 


In the following discussion, we have to distinguish between the actual 
|Fo(f ~ fe)| (denoted |Fo(f — fc) act) and the estimated |F2(f — fe)| 
(denoted |F2(f — fc) lest). In order to emphasize that the |F2(f — fe) | 
in (150) is the actual |Fo(f — f.)|, we rewrite (150) as 





ce’Ac = (f2 — fi) if i [|Fo(f — {lee sin i - | q(f) df. (11) 


When the actual |F2(f — f-)| is replaced by the estimated |\Fo(f — fe) |, 
A is replaced by S, and (151) becomes 





fafa 2 
ee = 1) [LPG ~ 1leed"[ sin LF at ay. 052) 
Let »(f) be defined as — 


= [|F¢f = fe lec: ( 
we =a 
From (151), (152), and (153), we obtain 





Ss fo a fi . (153) 


[ n99@ 
th Q(f) df 
cAc = Vinee Coke ee (154) 





ec Sc fa-fi 
[  9@ a 
where . 
90) = [LG — fOlea'| sin A fac. (155) 
It can be easily seen from (155) and (148) that 
A) 20, OSfSh—h. (156) 
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Furthermore, for c ~ 0, Q(f) cannot be identically 0 in the frequency. 
range 0 = f S fo — fi (for if so c’Sc would be 0, contradicting the 
fact that S is positive definite). It can be seen from (153) that 


a(f) = 9, OS 7S h— tis (157) 


From these properties of Q(f) and (f), one can see that 


[aq af 
rr eee ere (158) 
[oar 


where [7(f) ]max is the maximum value of 7(f) in the frequency range 
0=f =fe — fi. It can alse be seen that 


[1990 a 
[90 a5 


where [»(f) ]min is the minimum value of »(f) in the frequency range 
0=f S fe — f,. Now we tie the results together. From (140), (141), 
(154), and (158), we have 


2 Inf) lmin ? (159) 











~ x’ AX c’Ac 
A= x’ Sx < [eae] = [n(f) max . (160) 
Similarly, from (140), (141), (154), and (159), 
- AK _ [c’Ac | 
A= x’ Sk = [eae] =a [n(f) min . (161) 
We may combine (160) and (161) as 
[n(f)lmin SAX S [n(f) max - (162) 


It can be easily shown that the eigenvalue \; of RS~ is related to the 
eigenvalue X\ of AS"* by 


From (163) and (162), we obtain the final result 


1 — [n(f)]lmx SA; S1—([nG)Imin, t=1 to N. (164) 


Equations (164) and (153) are, respectively, (111) and (112) in 
Section VI. : 
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Combining Correlated Streams of 
Nonrandom ‘Traffic 


By SCOTTY NEAL 
(Manuscript received October 6, 1970) 


The Equivalent Random method 1s used for engineering many of 
the telephone overflow-networks in the Bell System. But since this 
method 1s not directly applicable to the analysis of graded-multiple 
trunk-groups which carry overflow traffic, we extend the method to 
cover such arrangements. The key to this extension is a technique for 
taking correlation into account when combining dependent streams 
of traffic which are themselves more variable than Potsson. In prin- 
ciple, the technique is applicable wherever a stream of overflow traffic 
ws divided, submitted to independent trunk groups, and then recom- 
bined. 

The extended Equivalent Random method provides adequate estt- 
mates of load-service relations for graded multiples which carry 
overflow traffic, provided the grading capacity is not substantially in- 
fluenced by the network that precedes the grading. 


I. INTRODUCTION 


The Equivalent Random method’? is used for engineering many 
of the telephone overflow-networks in the Bell System. However, the 
method is not directly applicable to the analysis of graded-multiple 
trunk-groups' which carry overflow traffic; e.g., gradings used as 
alternate routes in step-by-step switching systems having common 
control and alternate-routing capability. Apparently, Lotze®® is the 
only author with results for estimating load-loss relations for gradings 


+ The reader should have some knowledge of graded multiples and the methods 
associated with the engineering of telephone overflow-networks. Some familiarity 
with the step-by-step switching system would also be helpful. Those not ac- 
quainted with these concepts may find it worthwhile to consult Ref. 1 for a dis- 
cussion of telephone overflow-networks. An introduction to graded multiples is 
given in Ref. 3. Reference 4 contains a description of the pertinent aspects of the 
step-by-step system. 


2015 
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which carry overflow traffic. Unfortunately, his method requires data 
which cannot be obtained in a step-by-step system at reasonable cost. 

We extend the Equivalent Random method to cover such applica- 
tions. The key to the extension is a technique for taking correlation 
into account when combining dependent streams of overflow traffic. 
(In principle, the technique is applicable wherever a stream of over- 
flow traffic is divided, submitted to independent trunk groups, and 
then recombined.) In Section II, we derive the appropriate covariance 
function. In order to describe how the covariance function is used to 
extend the Equivalent Random method,* we begin with an example 
of an application of the extended method for the analysis of a step- 
by-step graded multiple. 

Figure 1 represents schematically a step-by-step grading which 
might be used as a final route. Each horizontal bar denotes one trunk 
(server). The traffic offered to the grading? is an overflow stream 
from a subordinate network. The traffic is represented by the mean 
a and variance v of the number of simultaneous calls that would be 
in progress if this traffic were carried on a full-access group without 
blocking. The diagram is designed to indicate that an arriving call is 
first directed at random to one of the four first-choice subgroups; 
i.e., an arrival is directed to the ith subgroup with probability 7;. 
After reaching a particular subgroup, the call hunts vertically upward 
for an idle trunk. If all three trunks in the subgroup are busy, the 
call overflows into the second major level of the grading. The call 
then seizes the lowest idle trunk in the second-level subgroup. If both 
trunks in the second level are busy, the call overflows to the third 
major level containing five trunks (usually called finals). If all five 
finals are busy, the call leaves the system and does not return; i.e., the 
call is blocked and cleared. The symbol ap and vo denote respectively 
the mean and variance of the overflow from the grading. 

In a step-by-step system, the arrangement of line-finders and 
selectors through which calls reach a grading causes an inherent load- 
balancing‘ over the first-choice subgroups; there is positive correlation 
between the numbers of occupied trunks in the individual subgroups. 
Attempting to introduce the correlation into our model, we assume 


+The Equivalent Random method is known to be adequate for estimating 
load-service relation for overflow-networks having Poisson input.’ 

+ For the present study, we assume that all offered loads are ee Hence, 
we are estimating single-hour capacities of gradings. Utilization of our results 
for normal engineering involving average busy-hour loads, would require periph- 
eral operations to adjust the load-service relationship to reflect the effects of low, 
medium, or high day-to-day variations in the load.® 
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Ca 5%) STREAM OF 
BLOCKED CALLS 


FINALS 


SECOND-LEVEL 
TRUNKS 


FIRST — CHOICE 
SUBGROUPS 


STREAM OF OFFERED 
CALLS HAVING MEAN 
(a,v) a AND VARIANCE V 


Fig. 1—Schematic representation of a step-by-step graded multiple containing 
21 trunks. 


the configuration given in Fig. 2. The four arrows above the full- 
access group denote (correlated) overflow streams caused by the 
corresponding input streams. The intensity of the 7th input stream 
1S Q; = pia. 

At this stage, we have modeled an arrival process having mean a 
and variance v. The individual substreams to the first-choice sub- 
groups of the grading are certainly correlated. How well the correla- 


(49,Vo) 
THE SUPERPOSITION 
f f OF THESE FOUR 
04 5Vj Qo 5V2 03 V3 Og V4 STREAMS HAS MEAN 
a@ AND VARIANCE V 
° C TRUNKS 


Fig. 2—A model for correlated input streams. 
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tion approximates the correlation that exists in an actual step-by-step 
system is a point which must be tested. In Section III, we show that 
the approximation works quite well when a large group of selectors 
is connected to the grading. In any event, we can view the above 
figure as a first approximation for the step-by-step problem. More- 
over, the general features of the above configuration are not restricted 
to step-by-step systems. 

Figure 2 also illustrates the two basic features of the Equivalent 
Random method. First, the method assumes that, for engineering 
purposes, only the first two moments of an overflow process are 
required. Second, the method assumes that any overflow process having 
mean a and variance v is adequately approximated by the overflow 
from a unique “equivalent system” consisting of a full-access trunk- 
group with Poisson input. Whenever a and v are known, standard 
techniques are available’ to obtain the equivalent system of c trunks 
and intensity a = a, + dy + a3 + ay of the Poisson input. 

The next step consists of using results obtained independently by 
Descloux!® and Lotze® to determine the mean q; and the variance v; 
of the overflow (due to a;) which is submitted to the ith first-choice 
subgroup of our grading. This “splitting” is determined by? 


a; = pia (1) 


and 


v0; 


v 
The covariance c;; between the ith and jth “split” streams is given by?° 

C3 = DiDiV — @). — @) 
After splitting, our system is represented as 


Cag 2%) 


— 


“Il 


“Il 
“Il 


a4. 2 V2 @3 »V3 a4 >a 


Fig. 3—Graded multiple with correlated input streams. 
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where it is understood that the parameters a; and v; of the individual 
substreams satisfy equations (1) through (8). Using the Equivalent 
Random method, we now approximate the individual substreams as 
overflows from full-access trunk groups. Moreover, to determine the 
total overflow which goes to the second major level of the grading, 
we view the entire system in the following manner: 


t ! f t 


Cay Vid Cay ,Via) (a43 ,Vi3) Ca14 Via) 








I f f 
Cay Vj) (az V2) (a3, V3) Cag Va) 


ew-{ : cre-{ : : Fe : 


ay Ai2 843 








Fig. 4—First cycle of application of the extended equivalent random method. 


Figure 4 indicates that an overflow of mean a; and variance v; re- 
sults from a Poisson stream of intensity a,; being offered to a full- 
access group of c,; trunks, 2 = 1, -:-, 4. Furthermore, the 7th sub- 
stream is offered to the three trunks in the 7th first-choice subgroup 
of the grading, and causes an overflow of mean a; and variance v4; 
to be submitted to the second major level of our grading. 


One can see the main reason for looking at the grading in the manner 
described above: the parameters a,; and 1; are easily computed by’ 


ai = 1: ,¢, +3(di:) 
and 


i;= cai ai (1, + 3) ta; — a; + i| 


2020 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


where H,,,(a) denotes the first Erlang loss-function (Erlang-B block- 
ing probability). 

To complete the first cycle of computation, we need to obtain the 
mean & and variance o of the total overflow submitted to the second 
major level of our grading. The mean @ is given by @ = 04_, a1; . 
Unfortunately, the variance ¢ is more difficult to obtain, since the 
individual substreams are correlated. . 

To obtain 3 we need the covariance, cov (2, j), between the ith and 
jth overflow streams which are offered to the second level. A reasonable 
method for computing cov (7, 7) has not been available in the past. 
Our extension of the Equivalent Random method consists of an algo- 
rithm for computing cov (7, 7) in a fairly efficient fashion for many 
configurations of interest. A derivation of the algorithm is given in 
Section II. 

After cov (7, 7) has been determined, @ is obtained from 


4 4 

= D1: Ae » cov (2, j). (4) 
t=1 peor 

Having & and #, we reduce the system configuration to that shown 

in Fig. 5. 


Ca ,V%) 


1 


(@,V) 


ne: 5—Starting point for the second cycle of the extended equivalent random 
method. 


The proportion of the traffic (@, 0) offered to the first subgroup of the 
second level is 6; = (ai. + a@i2)/&, and po = (aig + a14)/& is the pro- 
portion offered to the second subgroup. Consequently, one cycle of 
computation is completed. 

Repetition of the logic described above will yield estimates of the 
overflow mean a, and variance v) . Moreover, the cyclic nature of the 
procedure allows the logic to be programmed on a digital computer 
so that load-service tables and other relevant information can be 
enerated in a straightforward manner. 
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II. MATHEMATICAL MODEL 


A service system S is composed of a collection 9% of c first-choice 
servers, two groups 9, , 2. containing d, , d, second-choice servers 
respectively, and two last-choice groups £; , £2 each containing an 
infinite number of servers (see Fig. 6). The arrivals into the system 
are generated by »v independent groups of customers G,, --- , G,. The 
arrivals from group G; occur according to a Poisson process with in- 
tensity a;. All service times throughout the system are independent 
and have a negative-exponential distribution with unit mean. 

A customer, arriving to find an idle first-choice server, selects an 
idle server from 91, and service commences immediately. If an arrival 
from group G;, 7 = 1, 2, occurs when all c of the first-choice servers 
are busy, but at least one of the d; servers from the second-choice 
group 9; is idle, a server is selected from 9; , and service commences 
at once. If a customer from group G; arrives to find all the servers 
busy in both 9% and 9; , then he is served by one of the servers from 
the group £;. If k # 1 and k ¥ 2, requests for service from group 
G, , which occur when all c servers in SW are busy, are dismissed and 
do not return. 

We assume that the system is in statistical equilibrium and define 
M, N;, L; to be the number of busy servers in SW, 2; and £; respec- 
tively (at a random instant of time) for i = 1, 2. Define the state of 
the system to be (IM, N,, N2, L,, Le) with joint probability density 
function f(m, m1, 22,4, 12) = 

P{M=m,N,i,=m,No=m,k, =h, De = |b}. 


LAST - CHOICE 
GROUPS 
SECOND ~ CHOICE 


FIRST-CHOICE 
GROUP M 


ay Q2 a3 ay 


Fig. 6—System configuration. 
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Settinga = a, +a, + --- +4,, it follows that f must satisfy relations 
of the following form: . 


For0SmSc-1,0S5n, Sd, andl, 2 0, 


G@tmtn tm +h + b)f(m,m ,m ,h , b) 
= af(m —1,m , 22, , b) 
+ (m+ If(m + 1,m ,n,h , bk) 
Se EA Gn, ae pty UD) 
+ (te + Dil, m 5m +1, 4,4) 
+ (L, + Df(m, nm , ne, 1h + 1, L) 
+ (lp + Df(m,m ym ,h , b + 1). (5) 


Similar relations hold on the boundary of the state space {(m, m1, ne, 
Lib): 0OSmS605n, Sd,, 1, = 0}. Wedentie (45 lezero at-all 
points not in the state space. 

The preceding infinite set of equations is quite difficult to solve. 
However, when it suffices to know the various moments of the random 
variables L, and Lz, the problem can be simplified by introducing a 
two-dimensional binomial-moment generating- cn This function 
is defined by 


Bim, ns » Ne 7X » Xo) 


7 ye > f(m,m ,m2,4,L)a+2)°A +2)" ©) 


i1= 0 l2=0 


for -12%20,05m£6, and 0 <n < d. Assuming that the 
binomial moments 


oO 


Bi, .1,(m, ny , N2) ome yy pz) \ ‘ it fim, M ,N2 k, » kz) (7) 
nee L) Ul, 


exist, it follows thatt 


ie) 


e 1 l 
Bim, Ny Ne 5% »X2) = “* De > B,,, 1,(m, Ny ) No) ty'% 2 
1,=0 lg= 
Of course, the binomial moments B,,,,,(m, 1 , m2) are the entities 
of interest since . 
t Various manipulations of these double series will be carried out in the sequel. 


The mathematical justification for the validity of the manipulations can be 
obtained from Ref. 11, Sections 5.3 through 5.5. 
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Aislin 


In particular, Bao) = E (Ly), Bo,1) = E (La), and Byray = E (L,L2) 
so that cov (Ly ; La) = Biaay = Bio) Bro) 3 

Relations for the binomial moments are obtained by multiplying 
both sides of Equation (5) (and the boundary equations which were 
not given) by (1 + z,)'*(1 + 2)'* and summing on J, and J, . Equating 
like powers of x, and x, yields the following finite system of equations: 


fOsmsc-—1landOSn;Sd,;, 
(atm+tn, tne t+ l, + &)Bi,.1,(m, mm, , ne) 
= aB,,.,(m — 1,7, , m2) + (m + 1B: .1,(m + 1, n, , n2) 
+ (n, + 1)B,,.1,(m, n, + 1, ne) 
+ (mz + 1)Bi,,1,(m, nm, , 2 + 1). (9) 
For 0 S$ n; S d; — 1, 
(a ta,tetn tn. +h + b)Bi1.6,m Mm) 
= aB,,.1,(¢ — 1, m , ne) 
+ a,B,,.1,6€,m — 1, 22) 4+ 2B: .1,6,m , me — 1) 
+ (nm, + 1)Bi,.1.6, m + 1, ne) 
+ (m2 + 1I)Bi..€, 1 , M2 + 1). (10) 
Whenever 0 S m S d, — 1, 
(a, te +m + dz + 1, + 1)Bi,.1,(€, 1m , de) 
= aB,,1,(¢ — 1, , do) + a,Bi,,1,(¢,m — 1, de) 
+ a,B,,.1,, 7, ,d2 — 1) 
+ (n, + 1)B,,.1,€, m + 1, d2) + a2Bi,.1,-1€, m , de). (11) 


A similar result holds for 0 S ng S dy, — 1. At the extreme boundary 
point (c, dy ) dz), 


(sd Fb de he) Bi, a (6 da 5 da) 
= aB,,1,(¢ — 1, dy , de) + a,Bi,,1,(€, dy — 1, de) 
+ a.B,,.1,, di , dz — 1) 
+ 4,B,,-1.1,€, d; , d2) + a2B1,.1,-:(¢, di , de). (12) 


m=0 ni=0 n2a=0 


c dy dz 
Bais yO. Cal ie) = ol 
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Since Byy (m,n, M2) = P{M = m, Ni = n1, N2 = ng}, it follows 
that 


e dy da 


»s a Ss Bo o(m, m , m2) = 1. (13) 


m=0 ni=0 ng=0 


We set B,,..,(m, m1, n2) = 0 for any point (J, , 12 , m, mn, , Nz) not in 
the set 


{li,l,,m,m,n): 1. 20, OS mSc, OS7n; S d;}. 


A very useful relation can be obtained by summing all of equations 
(9) through (12) to obtain 


da 
(1, aE ls)Ba,.ta) = Ay > Bi-1,1,€, dy , M2) 


n2=0 


dy 
+ a2 >> Bizt,-1(€, m , de). (14) 


ni =0 


Consequently, Buia) = H{LZ,L2} can be obtained from 


Qi(m, NX) = by By o(m, m ; Ne) (15) 
and 
ds 
q2(m, n) + pa Bo(m, ny, , Ne). (16) 
That is, 
2Bor.1) = Aige(c, dy) + agile, de). (17) 


Relations for qi and q» are obtained directly from equations (9) 
through (12) (see Appendix B). However, the relations require 
Boole, m1, dz) and Boo(c, di, n2) forO Sn; = d;. (See Refs. 12 and 
13 for a related problem.) Setting l, = l, = 0 in equations (9) through 
(13) yields, with equation (14), a system of (c + 1) (d; + 1) (dz + 1) 
independent linear equations for By. which in principle can be solved 
numerically. Unfortunately, for the step-by-step applications de- 
scribed in the introduction, c can be quite large (20 or more) although 
d, and d, normally do not exceed 5. Consequently, systems of 500 
and more equations would not be uncommon. Since a solution might 
be required for several sets of parameters in any particular network, 
a direct numerical solution is not attractive. 

In Appendix A, we obtain a closed-form expression for Boo (m, m , 
nz) in terms of the (d; + 1) (dz + 1) constants 


CORRELATED TRAFFIC-STREAMS 2025 


maT 
M1) L Ne 


c 
Furthermore, these constants satisfy (dy + 1) (d, + 1) independent 
linear relations, so that a numerical solution is quite practical. In fact, 
the maximum size of the system of equations requiring solution for 
step-by-step systems is reduced from more than 500 to 36. 

A closed-form expression for q,(c, d2) and qz(c, dy) is derived in 
Appendix B. These results combine into the following computational 
algorithm for cov (L, , Le): First of all, 


B(0, 0) = E;,.(a) (19) 
(the Erlang-B blocking probability for the first-choice group), and 
for0 Sn Sdi,m + nz > 0, 


IIA 


n, <d;. (18) 








(ny + N2)¥ni+nB(M1 Ne) 
= a,B(n, = 1, nz) + a8 (ny 7 Ne — 1) 
= | a; ja ,N2) — “| ds a , de). (20) 
nm — 1 N2 — 1 
By definition, B(n1, — 1) = B(— 1, m) = 0 for0 Sn, = d;. The 


numbers vy, are intimately related to various aspects of overflow sys- 
tems! and satisfy the following recurrence relation: 


* = By..(a) 21) 


and 





od 





(22) 


V, 
: ~ Wy 1 


Then, (from Appendix B) 


Float | 








nc, dy) = 2 dat+l1 a (23) 
Elo | Be] 
j = 1 k=7+1 k 
and 
2 |G, a) TI 
a 7 (24) 


qc, d,) = rer a : 
ELS ana 
7=1 j- 1 k=j+1 k 
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From Appendix A, 


E{Iy} = a,B(d, ) 0) and E{L,} = a6(0, ds), (25) 
so that 


2 cov (Ly , Le) = ayqo(c, d:) + aoqi(c, de) — 2a:0,8(d; , 0)8(0, dz). (26) 


Equations (19) through (26) constitute an algorithm for the com- 
putation of cov(L, , Le). 
Whenever 


a, = Ae and d, = d. 


the symmetry of the problem (see Fig. 1 and equation (18)) implies 
that 


B(m , M2) = B(ne , m1). (27) 
The symmetry required by (27) would prevail for most step-by-step 
graded multiples. In such cases (27) can be used to reduce the number 
of equations to 4% (d,; + 1) (d, + 2). Consequently, the dimensions 
of the systems of equations needed for an analysis of most step-by-step 
graded multiples would not exceed 21. Such systems can be solved 
very efficiently by numerical matrix inversion. 


III. NUMERICAL RESULTS 


In order to establish a base for comparison, we used a simulation 
to obtain load-service relations for a 25-trunk and a 45-trunk step- 
by-step graded multiple. We obtained results for several values of 
variance-to-mean ratio z = v/a and several selector configurations. 
We found that the extended Equivalent Random method furnished 
adequate estimates of blocking probability in each case where the 
maximum number of selectors was used. However, as the number of 
selectors was reduced for a particular grading, the inherent load- 
balancing* caused actual grading capacity to be higher than indicated 
by the extended Equivalent Random method. Consequently, we con- 
clude that the extended Equivalent Random method provides adequate 
estimates of load-service relations for graded multiples which carry 
overflow traffic, provided that the network through which calls reach 
the grading does not significantly influence grading capacity. 

In view of the effort required to obtain the covariance cov (JL; , L;), 
one naturally questions the necessity of accounting for the correlation 
which occurs in our problem. In fact, on several occasions, we en- 
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countered the following question: What sort of results would be 
obtained if both mean and variance were split by proportion (i.e., 
a = pia and v; = pw) when splitting is required, and cov(L;, L;) 
were assumed to be zero whenever a variance recombination is re- 
quired? 

In order to consider the preceding question, as well as to obtain a 
better understanding of the behavior of cov(L;, L;), three computer 
programs were written to generate load-loss relations for step-by-step 
graded multiples. These programs were based respectively on the 
following assumptions: : 


(t) The input traffic is completely random, i.e., Poisson.‘ 

(47) The input traffic is nonrandom. Mean and variance are split 
by proportion when required and cov (L; , A) is assumed to 
be zero.? 

(7t) The input traffic is nonrandom, and the gradings are analyzed 
by using the extended Equivalent Random method. 


Throughout the study, the offered traffic was assumed to be balanced 
over the subgroups of any grading under consideration (i.e., Pi =. p; 
for all 2, 7). 

For comparison, we used each of the three assumptions to compute 
load-service relations for a 25-trunk and a 45-trunk graded multiple. 
The results are displayed in Fig. 7. For these examples, the variance- 
to-mean ratio of the nonrandom offered traffic was: held constant 
at 2.25. : : 

The different results arising from assumptions (2) and (222) were 
surprising. It was originally felt by some that assumption (i) would 
cause over-trunking, but not by the amounts indicated. For example, 
notice that the 45-trunk. grading yields a B.01 blocking probability 
for an offered load of 330 ccs (9.17 erlangs) with a variance-to-mean 
ratio of 2.25 when the correlation is neglected as outlined in assump- 
tion (2). However, Fig. 7 indicates that the same traffic can actually 
be handled at B.01 on the 25-trunk grading when the correlation is 
taken into account. Hence, assumption (iz) leads to at least 80 percent 
overtrunking for the example. An examination of other portions of 
the curves yields similar results. conecicny sesunpnon (27) must 
be discarded. : 

Lower bounds to the load-loss relations for nonrandom traffic 


+ Assumption (2) is known to cause an underprovision of trunks.t 
* Assumption (wz) was considered by many to be the most natural approach 
to improving on assumption (2). 
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Fig. 7—Load-loss relations for the 25-trunk and the 45-trunk graded multiples. 
Variance-to-mean ratio of the offered load is z = 2.25. 


result from assumption (7) as illustrated in Fig. 7. For these examples, 
undertrunking by 18 to 25 percent results from approximation (2). 
Since the disparity will increase for larger variance-to-mean ratios, 
assumption (?) does not seem applicable either. 

Two other approximations were also tried but the results were very 
poor. The first used the correct splitting equations (1) through (3) 
but assumed cov(L;, L;) = 0. As a result, some of the load-service 
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curves actually intersected each other. The second also used the 
correct splitting formulas but assumed the correlation coefficient. 


cov (L; , L;) 
[var (L,) var (L,)]* 
to be constant at recombination points and equal to the correlation 
coefficient for the split (nonrandom) offered traffic. The predicted 
blocking resulting from the last approximation actually decreased as 
the intensity of the offered traffic increased. 


pL; ) L;) = 


IV. CONCLUSIONS 


We have presented a technique for taking correlation into account 
when combining certain dependent streams of overflow traffic. The 
result was used to define an extension of the Equivalent Random 
method; an engineering approximation for estimating the capacities 
of overflow networks. 

The extended Equivalent Random method yields good estimates 
of load-service relations for graded multiples which carry overflow 
traffic provided the network through which calls reach the grading 
does not significantly affect grading capacity. Consequently, for ap- 
plication to step-by-step gradings, the extended method is restricted 
to gradings which are connected to large groups of selectors. We are 
currently investigating techniques which would allow us to consider 
systems which do influence grading capacity. 
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APPENDIX A 


System State Probabilities 


In this section, we obtain the solution for the system of equations 
(9) through (13). We use a generalization of a technique employed 
by Kosten.** 
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For notational simplicity, let p(m, n:, n2) = Boo (m, m1, N2), and 
let p: (m, m1, N2) denote any function which belongs to the family @ 
of functions satisfying equation (9) for0 < m < » andO Sn; Sd,. 
Define 


oc d, dy 
P(t, i) G2) = dy Dy D pilm, ny, , n2)t" Gig’. (28) 


m=0 ni=0 n2=0 


It follows from equation (9) that 


a-y#+a- a) 5, + = a) $= all — OP. (29) 


This linear first-order partial differential equation can be solved by 
(Lagrange’s) method of characteristics (see Ref. 15, Chap. 2). The 
characteristic equations are 


with solutions 

1-— 

1 — “ 
where k and k; are arbitrary constants. Hence, the solution to equation 
(29) is given by 


=k, and k=e“P 





a Ls 
Pt, ",) =e ‘W(L—4 qe 14) (30) 


where H(z,, 22) denotes any analytic function of the arguments 
Z1, 22. From (28), it follows that the Taylor series for H must be 
finite, and so 


Pl, na) =e" SH alm ng(—2) (LS 2)". wn 


=0 n2=0 





Following the notation of Ristdan (Ref. 1, p. 89), let {o,(m): 
m = 0, 1, +--} be the sequence with generating function 
(1 — t)-* anil that is, 


x.) at 


Lal)" = Gaye (32) 


The variables o;(m) satisfy the following recurrence relations.1"4 
FormZ0Oandk2Z QO, 


mo,(m) = ao,(m — 1) + kox.,(m — 1), (33) 
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Kozei(m) = (k + m — ajo,(m) + ao,_,(m), (34) 
and 
YX eum) = onli). (35) 


It is convenient to define 
o_,(m) = o,(—1) =): (36) 
Hence, from Equations (31) and (82), 
P(t, q 5 4) 


ao dy dg 


DD DY aly , ma)onsn(mt"1 — wa)" — @)* ~— (37) 


m=0 ni =0 na=0 


sys [ioe aps alae , ks) 
NJ Ne 


m=0 21-0 na=0 ki=n, ka=na 


I 


ona Pata (38) 


Comparing equations (29) and (88), one can see that the functions 
pi in @ are of the form 


pi(m, ny Ns) = (pe > Ss i 


ki=ni ke=ne 
1 Ny 





ac sKe)on+us(m), (39) 
Ne 





and are determined up to the (d, + 1) (d. + 1) constants {a(m, nz): 
0 Sn; S d;}. There are (d, + 1) (d, + 1) independent linear equations 
among equations (10) through (13), and so it follows that there is 
exactly one function, p*, in @ which satisfies equations (9) through 
(13). The appropriate restriction of p* must be p. The remainder of 
this section is devoted to a derivation of the relations which the con- 
stants {a(n , N2)} must satisfy in order to obtain the solution. 

An equivalent but less complex set of boundary conditions is ob- 
tained by putting m = c in (9) and subtracting equations (2) through 
(13) respectively. Hence, if OS nn, =d; — 1, 


(a — a, — G)p, mi , M2) + ape, m — 1, m2) + asp, m , nm. — 1) 
=(¢+ 1)p(c + 1, , 2), (40) 


forO0=nm=d, —1, 
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(a — a,)pC,m , d2) + ape,m — 1, d2) + ape,m , dz — 1) 
= (+ l)pe+1,n, , d), (41) 
and a similar relation holds for 0 = no S dy — 1. Finally, 
ap(c, d, , d2) + apc, d,; — 1, d2) + asp, d; , d, — 1) 
= (c+ l)pe t+ 1, d, , d.). (42) 


Now, define 
dy dg 
PAQi ? Qo) = 2 22 p(m, ny » M2) Gi' Go’, (43) 
da 
Gin ns (Qo) = Dy p(m, ny » Nz) Qa’; (44) 
na=0 
and 
dy 
HmnslQs) = Ly pm, m , m2)Qi. (45) 


It follows from equations (40) through (42) that 


[a — a1 — q,) — a1 — @)IP.(m » G2) + ail — qi) qi Gea, (Qe) 
+ a1 — q2)q2°H «,a,(4:) = (¢ + WPesilt , GQ). (46) 
Equations (39) and (48) through (45) imply that 
dy da 


PAG ’ Qe) aa » a a(n, ) N2) Fn, +n(m)(1 = nv) “3 Qo)"”, (47) 


ni =0 n2=0 


Gals) = (I 3 lds , maoaen(( — a2", (48) 
and 
Hea) = (—Y)" YS alts dadonres(l — a)". (49) 


Using the identity gq, = 1 — (1 — q) and the relations (47) through 
(49) in (46) obtains 


dy 
a) 


n1=0 na= 


ds 


(OM y Ma) Far ens(C)L — ga)"(L — ga)” 


dit+l da 


— a SS D> a(n, — 1, M2)On ani) — gm) — qo)” 


ty =1 Tig=C 
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dy, datl 


— a, Ds .y a(n, » Ne — Lon, +n2-1€)(1 _ qi) _ Qo)" 


n1=0 na=1 
os dit+l da B d, 
= Glia DC) 
mi=1 n2=0 Ny == 1 


ra(d, , M2)oa,+n(€C) — mi)” — ge)” 


2a eS 9 ds | 
n 


ni=0 ng=1 
1 2 il 


aly ’ d2)on,+a,€)(1 = qm) r= Qo)" 


=C€+)  ¥ aly mouse + DO — a)" — a)". 


ny=0 na=0 
Equating the coefficients of (1 — qi)"*\(1 — q2)”’ yields 
[aonians(€) — © + Donan (e + Dlatm, , ne) 
= [aa(n, — 1, m2) + ara(n, , m2 — 1) + aea(my , m2 — Lon, +n,-1© 
dy 
nm —1 
ds 


N, — 1 


5 ee ea, 





fata , Nz) Fd, +na(C) 


+ ya) a(n, , d2)On,+a,(C)- (50) 





Using (33), we see that 
An +na(C) — (C + longa, (C +1) = —(ri + Ne)onangtile). (51) 
It is worthwhile to define 
B(m1 , M2) = (—1)" "am , Ne)on,+n, (6), and n= m+ 2. (52) 
Now, substitute (51) and (52) into (50) to obtain | 


. oO BQr , M2) = 48% — 1, m2) + a28(m M2 — 1) 


- a a Joa na 


Ny a, 1 
dz 
— Ay B(ny , do) (538) 
Ne = 1 
for 0 = n, S d; and n > O. (Both sides vanish for ny = ne = 0.) 
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One additional relation is required. Using (39), 


dy dg dy da 
» Ds p(m, Ny ? N2) = 2 > a(n, , Nz) Ons +na(M) 
n1=0 n2=0 n1=0 na=0 


ny 

ky 
= a(0, 0)oo(m). 

Consequently, noting relation (36), one obtains 


=> > > p(m, n, , Nz) = a(0, 0) py o(m) 


m=0 ni=0 n3g=0 


, = 








‘ ry 
ky 








a(0, 0)c,(c). 


Thus, 
igs 
are) = o,(c) 
and 
p00, 0) = 24 = Bd); (54) 


i.e., Bio,o) 18 the Erlang-B blocking probability (also known as the 
first Erlang loss-function) for the first-choice trunk group. 

Equations (53) and (54) completely determine B(m, nz) for 
0= nn; = d,. Using (52) and (39), we see that the state probabilities 
are given by 


p(m, Mm No) 


= ES corre nen) )ac ey GD. (65) 


ki=ni ka=ne 


Using the relation 








m) (n n} \m—n 
it is straightforward to show that if 
c dy da = 
Niwesaicnes = 4 z o hy pti, ky » ke) 
t=m ki=n1 ka=na m!| n:) n; 
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for0 =mZcand0 Sn, S d;, then an inverse relation is given by 


p(m, Ny 5 Ne) 


c. dy do ; 4 k k 
=) ly ee er ri 
i=m ki=n1 ke=na M) My) Ne 




















(56) 


‘Nitin 


forOSmScand0Sns d. 
Consequently, comparing (55) and (56) we see that 


B(n, , No) = Agendas 
N, 

















- 8) M Na t (57) 
c} ln) (ne 
Equations (14) and (55), imply that 
E{Ly} = apd: , 0), (58) 
and 
E{L2} = a,8(0, ds). (59) 
For the computation of B(n, n2) using (53), the ratio 
_ Fnsi() 
eee 7,(C) 


is required for n = 1. A recursive relation for the ratio is obtained 
from (32) via 





Onsi(C) _ [ +ia8| 4 Ii) | 





o,(C) n o,(c) ’ 
that is, 
afl c—a 
i (+) + . +1 for n21l. (60) 


The first-order (nonlinear) algorithm is initiated with 


1 _ aol) _ 
i eee oe 


1.e., the Erlang-B blocking probability for the first-choice trunk group. 


2036 THE BELL SYSTEM TECHNICAL JOURNAL, JULY-AUGUST 1971 


APPENDIX B . 
A Conditional Mean 


In this appendix, a formula is obtained for the computation off 


dy 
q (m,n) = > By, o(m, m , 7) 


= E{L,|M =m,N,=n}P{M =m,N,=n}. (62) 
From equations (9) through (12), it follows that forO0 =mSc-1 
and 0 =n S do, 
(a+ m+n -+ I)q(m, n) = ag(m — 1, n) 
+ (m + Lat + 1, 2) + (av + Val, n+ 1) (63) 
and for0 =n=d, — 1, 
(a2 +ce+n+ I1)q(e, n) = ag(e — 1, n) + aog(c, n — 1) 
+ (n + gle, n +1) + aBo,o(c, di, n). (64) 
Also, 
(¢ + dz + 1)q(e, dz) = ag(c — 1, dz) + azg(c, dz — 1) 
+ a,Bo.o(c, d: , dz). (65) 


Using the methods and results presented in Appendix A, we can 
show that 


A 


da 
qg(m, n) = > (—1)"“w, mail) for O<SmSc 


and OXSnXd., (66) 
where wo; = 0, and 
ds 


(n + 1) rn +101 = AgWn-1 — «| 
n—1 


Jo + a,B(d; , 7”) 


for OSnZd,. (67) 
In particular, g(c, dz) = @,, and can be obtained from (67) in the 
following closed form: 


+ Throughout this appendix, the subscript 1 on q: has been omitted for nota- 
tional simplicity. 


A 
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% Joao Hl es | 


qc, de) -_ Qe 7=0 k=7+1 


ee >it 7 | Tl as | 
Tea eee 





(68) 


k=j+1 ky, 


similar expression is valid for q2(c, d1). 
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Phase and Amplitude Variations in 
Multipath Fading of Microwave Signals 


By K. BULLINGTON 
(Manuscript received November 10, 1970) 


Experimental data on the duration of individual fades in a 4-GHz 
radio signal are used to derive the rates of change in amplitude and 
phase of a signal propagated through a 30-mile multipath medium. A 
by-product ts a relation between the rate of change in phase and the 
effective vertical velocity of the atmospheric trregularittes that can be 
considered to be the cause of the multipath phenomena. The derived 
distribution of velocities is then used to predict the variations in fade 
duration over a wide range of frequency and path length. 


I. INTRODUCTION 


Multipath fading on line-of-sight paths causes substantial variations 
in the amplitude and phase of the received signal. Theoretically, if the 
response to an impulse function were known with sufficient accuracy, 
all magnitudes and rates of change could be computed. In practice, 
this possibility is somewhat of an illusion because many different 
impulse responses occur on any given path, with significant diurnal 
and seasonal variations. Moreover, the desired accuracy is generally 
not achieved, because either the required impulse function needs to 
be only one cycle of the radio frequency or extensive correlation 
techniques are needed to achieve a comparable result. 

The quantitative estimates given below are based on elementary 
theory and on the measured distribution of fade durations. The results 
agree with (or at least do not contradict) the available experimental 
data. 


II. GENERAL CONSIDERATIONS 


The signal received through a multipath medium can be represented 
by the sum of the principal wave plus the delayed components (a,,e°“*"): 


2039 
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AM 

i A i A Atm 

E = Aew oi at ae w)At ) 
m=1 


= Ape erse! 


where a» and At,, represent the magnitude and relative time delay of 
the mth “echo.” 

The effect of frequency deviation, Aw/2z, is discussed in a later 
section on the variation in net loss within the modulation bandwidth. 
For the single-frequency case (Aw = OQ), it is convenient to define 
R, L, ¢, and y as follows: 


(1) 





i= Le pe eer 2S Le (2) 
_ _Kksing 
ey = 1—Rcos¢d 


where FR and 4, respectively, are the instantaneous amplitude and 
residual phase of the ‘‘composite echo” given by the summation term 
in (1) with Aw = 0. For the deep fades L K1,|1—R|SL,|¢| SL, 
and n is an even integer. 

It is shown in Appendix A that 


dL _ ds | Seo! 7 ak | 
di at | 927 (cos L) Bae ; (3) 
Ldy _ ds | z , i | 
dt ang dt (cos L) ee any p do ’ (4) 
and 
La = tan (« — 8) 
where 


ee ee Oe a ee _ sins) 
ae ( — ;) a (—* —R 


— -1 ___tany -] MN 
ran ( —TL/eosy] ~ 


tS Tg 


R do 


At the bottom of a fade dL/dt = 0 and L dy/dt has a maximum 
value of 





dg, 
dt (cosy — L)’ 


MULTIPATH FADING 2041 


conversely, dL/dt has a maximum value of 


dp fr oo. 
dane when dy = 0. 


Frequency selective fading requires that d¢/dt > dR/dt(g < 1), 
but dR/dt may not be entirely negligible at the very deep fade levels. 
This observation is consistent with physical models of “reflecting” 
layers that are slowly rising or falling with time. The physical model 
suggests that during any particular fade dé/dt and dR/dt can be 
constant, or at least they vary much more slowly with time than the 
“non-linear” parameters L, y, dL/dt, and dy/dt. This information is 
useful in examining the detailed wave interference characteristics 
during a few seconds or minutes in time. On the other hand, the 
process is nonstationary and the average fading characteristics taken 
over many fades depend on the long term average values of dé/dt and 
dk/dt, and hence on meteorological conditions. 

Since the variations in signal level are caused by variable metero- 
logical conditions along the path, experimental data are needed to 
obtain quantitative results. In principle, it should be possible to pre- 
dict the radio results from meteorological theory, if the meterological 
parameters were known with sufficient accuracy. In practice, this 
indirect approach is not feasible and direct radio measurements are 
required. Data on the number and durations of deep fades will be used 
to estimate the probability distribution of cos y and d¢/dé for use in 
(3) and (4). 


2 


Ill. NUMBER OF FADES VS DEPTH OF FADE 


Experimental data show that 20-dB fades occur about ten times 
more often than 40-dB fades and on the average last about ten times 
longer. In other words, both the number of fades and their average 
duration are proportional to L.1 Of all the fades reaching a given level 
L, , 80 percent will “bottom out” at or before 0.7L, , 50 percent will not 
go deeper than 0.52, , and only 10 percent will reach or go deeper than 
0.1L, . Thus the probability can be written as 

Tis 
min < = 
Prob (Pe s x] xX 
where X varies uniformly from 0 S$ X S 1. 

It can be seen from (2) that 

1—Ros¢_ Rksng 


es = 2 
L=(1—Rcos¢) V1 + tan’? y ay iy 
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and @ S L. At the bottom of a fade, ¢ — 0, dL = 0, y = 0, and 
Dinin ~ 1 anal Re . 

It follows that at any level L, < 0.1 the ratio 


Lanin aus ( 1 ome Ro cos 
L, \1—R, cos¢, Le 
~ecosy, for Rk, SR. (5) 


Thus the experimentally determined cumulative distribution for 
Lmin/L, also applies to cos y, when dR/dt K d¢/dt; that is 


Prob (cosy S$ X) = X 
and the average values are 


Luin 
L, 


The corresponding average value of sin y, needed in a later section, 
is given by 








1 
~ cosy = | X aX = §. 
0 


1 ) ane me 
| V1 — X'dx =". 
0 4 


IV. DURATION OF FADES 


The average duration of a 40-dB fade (ZL = 0.01) at 4 GHz on a 
30-mile path is about 3 or 4 seconds, and the average duration of a 
20-dB fade is about 10 times longer. The results of an extensive pro- 
pagation experiment have shown that the distribution of fade dura- 
tion 7 follows a log normal distribution with a standard deviation of 
about 10 log (716/750) = 5.6 or In (716/750) = 1.3. Quantitatively, these 
results are summarized as follows for 4 GHz on a 30-mile line-of-sight 
path: 


99 percent of the fades are shorter than 4000 Z seconds, 
70 percent of the fades are shorter than 400 L (average duration), 
50 percent of the fades are shorter than 200 ZL (median duration). 


These data are plotted on Fig. 1 and it is assumed that fade durations 
shorter than the median are given by the extension of this log normal 
distribution with the same standard deviation.* 

* Recent unpublished data on the duration of rapid fades indicate that the 


standard deviation may be somewhat larger than the “5.6 dB” used in this paper, 
which means that. the lines on Fig. 1 might be more nearly vertical than shown. 


MULTIPATH FADING 2043 














FADE DURATION IN SECONDS ~ 

















PERCENT OF FADES 


Fig. 1—Duration of fading at 4 GHz on a 30-mile path. 


The voltage-time diagram shown on Fig. 1 defines L, Lin, 6, and 
the fade duration 7 for a fixed frequency. From the diagram it can be 
seen that for dR/dt = 0 the fade duration 7 is given by 














9 sin? (Zi sin y,/R) 2 5 sin V1 
= de : saat R (6a) 
dt dt 
and since the average value 
siny, == 
t= 4 ) 
d¢ -_ rly 
dt 27R (6b) 
Substituting into (3) and (4), the average values are 
dL d(nL) = r° (72) 


Ldt~ at 87R’ 
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dy _ me Cee 2b) 5 
i = OER 5 ~ TR for L<1. (7b) 


It follows that, since 20 log L = 8.7 In L dB, 
dy _,2d(inZ) _ 2dB/s _ dBA 


. Wee Oa Cie 1a (8) 
Figure 2 shows the distribution of rate of change of phase dy/dt and 
the rate of change of Z in dB/second, based on (7a) and (7b) and 


on the distribution of fade durations shown on Fig. 1. 

















_ It will be noted that if y were uniformly distributed (sin y = cos 7) 
it would be expected that 
Tae 
dB/s = 8.7 di’ 
but this assumption is not consistent with the experimental result that 
the number of fades is proportional to L. 

In a similar manner, when d¢/dt = 0 ({| is a constant < Din) 
the fade duration might be represented by 


pee) Gass eR) 
dR os dR 1 0 
dt di 


or 





AVERAGE RADIANS PER SECOND 
AVERAGE DECIBELS PER SECOND 





PERCENT OF FADES 


Fig. 2—Rates of change in amplitude and phase during fading at 4 GHz on a 
30-mile path. 
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a _ at [m= a] 
dt +r [B= (9) 


There is no obvious way to relate a change in R to a change in mete- 
orological conditions and there is doubt that it is important to do so. 
The frequency selective nature of fading requires that d¢/dt be con- 
trolling most of the time. In addition, essentially all fast “flat” fading 
can be explained as a phase interference effect with a delay of one 
or three half-cycles as well as by a supposed change in R. Moreover, 
the experimental results that the number of fades and their average 
durations are both proportional to L are characteristic of a Rayleigh 
type distribution, which is usually explained on the basis of near- 
random phase. Consequently, it is assumed in the remainder of the 
paper that 

do aR 

di” dt" 


V. VERTICAL VELOCITY OF ATMOSPHERIC IRREGULARITIES 


Although dL/dt and dy/dt are important parameters in the design 
of communication systems, the search for a better physical under- 
standing of the fading mechanism needs to begin with the less com- 
plicated parameters R, ¢, and d}/dt of the composite echo given by the 
bracketed term in (1). It may be seen from (4) that for dR/dt = 0, 


dp Ldy 1 


dt dt (cosy — L) 
which on the average is 
dp 2L dy 
dt ~ dt 
In principle, the rate of change in phase d¢/dt might also be 
obtained by either: (2) direct measurement of phase, or (77) compu- 


tation from known or assumed values for vertical motion in the 
atmosphere. For example, it is reasonable to expect that 





(10) 





a Hi A (1) 


or conversely, 
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where d¢/dt is the rate of change in phase caused by an echo source 
moving vertically through a thickness (H, — Hy-1) at a velocity v. H, 
is the height of the nth Fresnel zone and H, — H,_, is the thickness of 
the nth Fresnel zone. The relation between d¢/dt and the effective 
velocity v is shown on Fig. 3 and the probability of exceeding specified 
values of d¢/dt has been obtained from (10) and from Fig. 2. It ap- 
pears that a vertical motion of only a few centimeters per second (a 
few feet per minute): is sufficient to explain the fade duration measure- 
ments and the values of dy/dt and dB/s derived therefrom. An al- 
ternative derivation of (11) is given in Appendix B. 


VI. FADE DURATION AT OTHER FREQUENCIES AND PATH LENGTHS 


The distribution of velocities obtained in the preceding paragraph is 
expected to be a meteorological phenomena independent of radio 
frequency. It should be possible to use this information to predict the 
fade duration at other frequencies and path lengths. 

Substituting (6b) into (11), and using equation (17), it follows 
that the | 


Average Fade Duration = 


PERCENT OF .FADES 
IN RADIANS PER SECOND 


1073 


dé 
dt 











0.1 1 10° 100 
FEET PER MINUTE 


01 OA 1 10 100 
CENTIMETERS PER SECOND 


Fig. 3—Change in phase at 4 GHz vs vertical velocity on a 30-mile path. 
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where D is the path length and \ is the wavelength. Equation (12) 
indicates that the fade durations vary as the ~/) D, if the effective 
value of n is essentially independent of frequency for the average fade. 
The preceding information and assumptions lead to the following 

estimates for the average fade duration at various frequencies, when 
the path length D is measured in km. 


Average I’ade Duration 


Frequency in Seconds 
D 
160 MHz 2000 L 2 
50 
D 
4 GHz 400 L ne 
D 
6 GHz 320 L <2 
6 X 10° GHz ‘ 2 
(visible light) 50 


The experimental results at 4 and 6 GHz are not accurate enough to 
establish the A”? variation over this narrow range, but at lower fre- 
quencies the fades, when they occur, last longer than at 4 and 6 GHz 
so the fading rate is not independent of X. 

An extrapolation of the radio data to optical frequencies indicates 
that the average fading rate on a 5-km path through the atmosphere 
would be of the order of one or two cycles per second with much more 
rapid fluctuations for a small percentage of the time. In principle, 
scintillations are rapid multipath phenomena. Although differences, 
such as the effect of water vapor, antenna beamwidth, etc., need to 
be included, the fading rate must vary with the wavelength as ?, 
where 7: is a positive fraction less than one. A variation of as much 
as At is unrealistic because it would predict that the twinkling of the 
stars would ordinarily be above the flicker rate acceptable by the 
human eye. A consideration of both radio and optical phenomena 
requires a variation in fade duration with frequency close to 1/7 (at 
least in the range between A1/° and A?/*), so the elementary result 
shown in (12) is a reasonable working hypothesis. At optical fre- 
quencies the Fresnel zones are more nearly comparable in size to the 
atmospheric irregularities and both horizontal and vertical motion of 
atmospheric irregularities produce significant scintillations. 
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VII. VARIATIONS IN NET LOSS WITH FREQUENCY 


The principal interest thus far has been in the characteristics of 
fading at a single fixed frequency. Even if an automatic equalizer 
could be built to compensate perfectly for the effects of fading at any 
reference frequency, the resulting equalization would be less than per- 
fect at frequencies other than the reference frequency.?* This type of 
distortion results in variations in net loss (deviations from a constant 
output level) over the modulation bandwidth. By the use of the con- 
cept of Fresnel zones and some simplifying assumptions, it is shown in 
Appendix C that the variations in net loss relative to a fixed level 
at the carrier or pilot frequency is given approximately by 








20 log [1 — i Put . (13) 
ert : 
10 log {1 + Tr) )» (14) 


where 


A 
Oett = ae . 


f 

20 log L is the depth of fade in decibels (LZ K 1) and Meee is the effec- 
tive number of half-wavelengths in the relative delay of the strongest 
delayed component (or composite “echo’’) that is causing the fade 
at the reference frequency. The corresponding variations in net loss 
are shown in Table I and on Fig. 4. Table I indicates that most of 
the deep fades are caused by “echo” delays of 1, 3, or 5 half-wave- 
lengths (nerr = 5), because experience has shown that good diversity 
requires a frequency separation greater than 1 percent and that accept- 
able transmission quality can almost always be obtained with band- 
widths of 0.1 pereent or less. The corresponding values of Af/f for a 
20-dB fade are ten times larger than shown in Table I. 


A comparison between theory and experiment is shown on Fig. 4 
for the case of 40-dB fades. This figure also suggests that the median 
40-dB fade corresponds to 1 S ne S 5, but that 10 percent of the 
40-dB fades correspond to delays of many half-wavelengths. The curve 
for n = 100 represents the approximate upper boundary set by 40-dB 
antenna patterns at 4 GHz on a 30-mile path. The corresponding 
theoretical curves for a 20-dB fade would be shifted one decade to the 
right. The crossover of the experimental 40-dB fade data and the 
nm = 1 curve shown on the figure is not possible with a single echo, 
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TABLE I—VARIATIONS IN Net Loss 








Aj for 40-dB fade 

















f 
Car Net Loss —_——_———,—_—. 
T Variation ; Nett = 1 | mere = 3 | nese = 5 
(dB) Quality (%) (%) (%) 
0.1 0.04 |Excellent transmission 0.03 0.01 0.006 
0.3 0.4 |Acceptable transmission 0.1 0.03 0.02 
1.0 3.0 Poor transmission 0.3 0.1 0.06 
3.0 10.0 = _|Fair diversity 1.0 0.3 0.2 
10.0 20.0  |Good diversity 3.0 1.0 0.6 





and is an indication that more than two components are present much 
of the time. The ‘experimental 40-dB fade’ frequently represents 
the combination of two or more echoes whose effects add. to 40 dB; 
in this case, the limiting theoretical curve for n = 1 is shifted to the 
right because the principal component of the composite echo, taken 
by itself, represents a fade of less than 40 dB. 

The agreement between theory and experiment is considered satis- 
factory and as evidence that most of the fades are caused by strong 
components delayed by only a few half-wavelengths. It has been 
suggested that a better fit with the shape of the experimental data 
exceeded 10 percent and 50 percent of the time can be obtained if the 
1 = —1 in (13) could be omitted; such an assumption is rejected 


DECIBELS 








ome ome —— THEORY 


MEASURED DATA FROM 
KAYLOR, B.S.T.J. SEPT. 1953 


TIME DELAY =F X 1079 SECONDS 
nN+1=NO.OF FRESNEL ZONES 





Fig. 4—Variations in net loss during 40-dB fades at 4 GHz on a 30-mile path. 
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because it seems contrary to logic and because it would create dif- 
ficulties in explaining the data exceeded by, say, 90 percent of the fades. 


VIII. CONCLUSION 


Experimental data on the number and duration of fading have been 
used to estimate the rate of change in phase and the rate of change in 
amplitude of an electromagnetic wave received through a multipath 
medium. The elementary theory used herein achieves quantitative 
results, primarily because it explicitly includes the first-order effects of 
phase associated with the use of Fresnel zones. Alternative methods 
starting from the more fundamental wave equations are more sophis- 
ticated and may be more exact in a mathematical sense, but their 
inherent generality does not ordinarily lead to results that can be 
compared with experiment, unless somewhat arbitrary “constants” or 
correlation functions are assumed. 


APPENDIX A 


The relative magnitude and phase of a multipath signal can, by 
definition, be represented as 
(iS Va ee es Les 


where n is an even integer and R# and ¢ are, respectively, the instan- 
taneous magnitude and residual phase of the sum of all the delayed 
components. It follows that 


VW( — R cos 6)” + (Rsin ¢)” 





I 


L 


ll 
(— 
| 
x 
° 
jo) 
72) 
& 
= 
+ 
ct 
© 
P. 

a 





VL = ie + 2R(1 — cos $) 


1—Reos¢d_ Ksng 
cos Y sin y 





By differentiation, 


dL 





_ Rsin¢ dd — (cos¢d — R) dk 
re L 


(R cos — R? + 1 — 2R cos¢ — 1+ 2R cos¢) dR 
R 





sin y dé — 


et 


Say i (1 =u Et) aie 


\ ii 7 au 
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= sin y dg — (cosy — LH. 


In a similar manner, 
Ldy = (cosy — L) do + sin S. 


APPENDIX B 


As shown in several textbooks, the height of the nth Fresnel zone at 
the middle of path length D is given by 


VA Dn (15) 
2 
The corresponding phase delay is 6, = nz, by definition of Fresnel 
zones. - 
In order to restrict the parameter n to integral values, we can gen- 
eralize as follows: ~ 


A, = Yn = 


do _ _ds 
dt dt 


1 [A D@ +4) 
CRN ge 


_dW_1 [\Dd¢ 
dp ae N Gb cd (16) 


where v is the effective velocity in a plane perpendicular to the direc- 
tion of propagation. 
From (15) we have 


ae ae (1 = o—) 


wi pe 
IN for n> 38. (17) 


By substituting (17) into (16), we have 


6=nr — ¢; 


and 


It follows that 


v 


de - Tv 
di H, — Hy.’ 
which is the same as equation (11). . 
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APPENDIX C 


As noted in (1) the signal received through a multipath medium 
can be represented by the sum of the principal wave plus the delayed 
components (a,,e°*'"): 


M 
E = Aetorsor(y ae > a,efirseerarn (18) 
m=1 


where Gm and At,, represent the magnitude and relative time delay of 
the mth “echo.” The term in brackets can also be resolved into Fresnel 
zones so that 


N 
ri Dae err (19) 
n=1 


where n is an integer and a, is the magnitude of the signal in the 
(n + 1)th Fresnel zone. The upper limit N is the maximum number of 
Fresnel zones enclosed by the antenna beamwidth; components with 
longer delays, if present, are relatively unimportant because their 
magnitudes are reduced by the off-beam antenna patterns. The ex- 
pression for v can be separated into the following three terms: 


N N 
v= 1+ Dae + > ae*7e — 1) (20) 
n=1 n=1 


where § = (Af/f)naz. The first two terms in (20) show the fading 
characteristic at frequency f, which in principle can be corrected by 
a perfect automatic equalizer based on a carrier frequency or pilot 
tone. The third term on the right in (20) shows the relative variation 
(distortion) at frequency Af. 

By definition, let the first two terms be 


N 
1+ >> ae" = Le‘. (21) 
n=1 


Substituting into (20) and introducing an effective value of 8, we have 


76 
= Le? + Lett > ane (° 1) 


10ers 


3 
| 





I 


N N is 
Le *? + iBsn| Le —1— Dae’ + > ae™(2 ty] 
n=1 n=1 


Vert 


N tn 
= Le ** = ibn = de: = +3 ane™*(2 : ! = :)|. (22) 
n=1 1ert 
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By this series approximation and a suitable choice for the effective 
value of 8 (Serr), the last term on the right can be made negligible 
during fading conditions (ZL « 1),* so that 


< iy 
vey Le (1 — iduut"”), (23) 
A “perfect” equalizer (at any given fixed frequency) would leave a 
residual distortion of 


1Sorce\” 
a 
The phase angle y is zero at the bottom of the fade, is +45 degrees 
at 3 dB above the bottom, and varies over the range — 7/2 < y < 7/2. 
In the region of good transmission (8err/Z &K 1) it is sufficiently ac- 
curate to approximate the net loss variations as 


= 10 log E + (2)"]. (25) 


While both Serr and Z are individually less than unity (and usually 
<0.1) their ratio can be greater than unity, and in that case the net 
loss variations calculated from (25) need to be interpreted as the 
median value of a distribution of net loss variations. 


, 


v=l—- (24) 


Lert 


20 log v’ = 20 log | 1 — 
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The Computation of Error Probability 


for Digital Transmission 


By F. S. HILL, JR. 
(Manuscript received October 19, 1970) 


A method is presented for calculating the probability of error for 
a digital signal contaminated by intersymbol interference and additive 
Gaussian noise. The method constructs a close approximation to the 
probability density function of the intersymbol interference, circum- 
venting the heretofore formidable computational problems by decom- 
posing the calculation into a sequence of simple calculations. This 
method is applicable to binary transmissions, as well as 4-, 8-, 16-,... 
level transmassion. The method is rapid enough for use on time-sharing 
facilities. As part of the method, a new and rather simple scheme 1s 
presented for including the effects of partial response source coding. 
Several interesting examples which use and give insight into the 
method are included. 


I. INTRODUCTION 


In a large class of digital transmission systems, a succession of 
amplitude modulated pulses is sent over a channel to the receiver. The 
signal suffers two main types of distortion: additive noise and inter- 
symbol interference (ISI). The latter arises from dispersion in the 
channel, and is a noise-like process due to the overlapping of many 
neighboring pulses.1 The number of these neighboring pulses, or “in- 
terferers,” depends on the system impulse response, and can be rather 
large for systems with restricted bandwidth. 

To properly analyze a system one must compute its probability of 
error due to the noise and intersymbol interference. The effect of the 
additive noise is simple to analyze, but that of the ISI is extremely 
difficult to find due to the complicated nature of its probability density 
function (pdf). The complexity of the pdf grows exponentially with 
the number of interferers, and for more than about 12 interferers the 
computation of error probability has long seemed impossible. 
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Until recently researchers have analyzed such systems by one of 
three main routes: (7) they have used more tractable—although less 
meaningful—performance measures such as peak distortion? or mean 
squared error,?* (7) they have found upper bounds on the probability 
of error,* or (72) they have found the probability of error using a 
truncated impulse response, for which only a few interferers were 
assumed to be significant.®* Unfortunately, in many restricted-band- 
width systems as many as 50 or 60 interferers can be important, so the 
use of a truncated impulse response is inadequate for a large class of 
systems. The goal is therefore to find methods which will treat large 
numbers of interferers while avoiding the exponential growth of com- 
putational complexity. This paper describes such a method. 

Recently two other schemes have been presented which also provide 
accurate estimates of the error probability for large numbers of inter- 
ferers.7® These methods do not provide the pdf of ISI, but calculate 
instead the terms in a series expansion of the expression for the error 
probability. They were devised primarily for the binary transmission 
case, and the extension to multilevel transmission would be rather 
awkward. 

The new technique presented here constructs a close approximation 
to the pdf of intersymbol interference, and uses it to find the error 
probability. Knowledge of the pdf can provide useful information 
about the intersymbol interference process, illuminating for the sys- 
tem designer the relationship between certain types of channel charac- 
teristics and error performance. The technique is applicable to binary 
transmission, as well as to 4-, 8-, 16-, --: level transmission. The 
method also allows simple inclusion of certain kinds of source coding 
such as partial response.® The ability to treat the multilevel case and 
different kinds of source coding is very important. In feasibility studies 
a system designer might know the general type of channel charac- 
teristics the signal will encounter, but he does not know how sensitive 
the signal will be to various combinations of numbers of source levels 
and source coding. The method given here is envisioned as providing 
a valuable tool in such studies. 

In Section II the probability of error is related to a single random 
variable, the error voltage z, which then characterizes system per- 
formance. It is this random variable whose pdf is sought. Examples 
taken from the applicable class of source coders are presented and it 
is shown that the effects of coding can be transferred from the source 
statistics to the channel description. Alternative performance measures 
are noted for later use. In Section III the details of the method are 
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described with the computer programmer in mind, and various em- 
pirical rules-of-thumb are suggested. In Section IV several simple 
examples are presented which illuminate some of the potential uses of 
the method. In Section V the accuracy of the method is considered, 
and guidelines for choosing some parameter values in the method are 
given. The method presented here is an extended and refined version 
of a technique developed by W. E. Norris.?° 


II. ANALYSIS 


Figure 1 indicates the system under consideration. The source emits 
a symbol every T' seconds. The symbols are statistically independent, 
and form the sequence {a,} where each a; takes on one of the N 
equally likely values m(7) 


m(i) = (2 -N—1)V/(N—-1), 1=1,2,---,N. (1) 


The outermost levels are +V volts. The sequence is then passed 
through a coder to form the sequence {c,}. Only two kinds of coding 
are treated in the main body of the paper, although the method is 
extended to more general coders in the Appendix. The two special 
types treated here are the uncoded case and the class IV partial re- 
sponse case,® where 


uncoded: c, = 4; , 


partial response IV: c, = a, — @,-2. (2) 


Partial response IV coding provides useful shaping of the signal spec- 
trum, and precoding of the source sequence can be used to prevent 
error propagation.® It is assumed that the sequence {a;,} has already 
been precoded appropriately. 

The sequence {c;,} modulates the impulse response r(t) of the chan- 
nel. White Gaussian noise is added, and the received signal y(t) is 
sampled every 7' seconds. The decision device (typically an A/D con- 
verter), determines which of the possible transmitted levels each 


|RECEIVER 
| 
“ | 
IMPULSE RESPONSE —~ NOISE | 
r(t) | 


Fig. 1—System configuration. 
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sample is nearest, and outputs the sequence {d;}. (The receiver can 
base its decision simply on the present sample because of the pre- 
coding.®) Ideally, each d;, = c; , but because of the noise and ISI they 
will occasionally differ, and an error is made. The method discussed 
here calculates the probability of occurence of such an event. 

The sample at t = 0 has the form 


eo 


y(0) = 2 ew(—kT) +n, (3) 


where n is a realization of a zero mean Gaussian random variable 
with variance o?. Because the sequence {a,} and the noise are sta- 
tionary in a statistical sense, studying the error probability for the 
single sample y(0) is equivalent to studying it for the entire ensemble 
of samples y(mT). 

Ideally y(0) will equal co. The error voltage z is therefore given 


by 


x 
II 


> Cyt, +N, (4) 


k=-0@ 


where 


x, = r(—kT) — 6, (5)* 
are the “error samples” of the system. It is very convenient to in- 
corporate the effects of any source coding in the error samples. In the 
case of partial response coding: 


ie] 


25 Dy, (a, = Oy-2) Xr, +n 
k=-0 
= DF ae +n, (6) 
k=—-0 
where 
Cy = UE — UX K+2) (7) 


are the ‘coded error samples.” Thus z of (6) depends on the simple 
statistics of the uncoded source symbols a; . (If no coding is used, then 
Cy = Ly.) 

The error voltage z is the sum of a large number of independent ran- 
dom variables. Each a, has the same discrete probability density 
function: A set of N equidistant Dirac delta-functions with weights 


* §, = Kronecker delta function: 6 = 1; & =0,k <0. 
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1/N. Each a, is sealed by the coded error sample e; . The n is Gaussian, 
and assumed independent of the a,’s. The pdf p,(-) of z is found by 
convolving the individual density functions of the constituent random 
variables. Because each of these constituents has a symmetrical pdf, 
about z = 0, p,(-) is also symmetrical. Once p,(-) has been found, the 
error probability for the sample y(0) is simply the probability that 
y(O) is further from cy than from some other level. When co is an inner 
level this is just the probability that |z| > V/(N — 1). When c is an 
outer level only one polarity of z causes an error, and an error occurs 
with the probability that z > V/(N — 1). For uncoded symbols outer 
levels occur with probability 1/N, while for partial response IV coding 
they occur with probability 1/N?. Hence the error probability is 


=21-N) [ a 5 pale) de, ) 


where h = 1 for an uncoded source, and Ah = 2 for a partial response 
IV coded source. 

Clearly, the real task in finding P, lies in computing p,(-), since so 
many random variables are involved. The method used to achieve this 
is described in the next section, after two simpler measures of system 
distortion have been introduced. These measures have been widely used 
in the past, and will provide useful definitions in the discussion to 
follow. They are both defined in the noise-free case (i.e., n = 0) here 
in order to isolate the effect of the intersymbol sri enterente: 


2.1 Mean Squared Error’"* 
The variance of the error voltage z is given by 


Ely) — el” = ED ae] = Ela’) 2 Ck; (9) 


where H[a?] is the variance of the source. A simple calculation based 
on equation (1) shows that H[a?] = 4V?(N + 1)/N — 1). A nor- 
malized version of the mean squared-error then depends only on the 
coded error samples, and will be called MSE: 


MSE = >> é&. (10) 
k 
2.2 Peak Distortion” 
The maximum value that z of equation (6) can have when n = 0 is 


maxz= V >> le |, (11) 
ay k 
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which occurs when all of the a; symbols have maximum amplitude V 
and such signs that the interfering symbols add constructively. Such 
a sequence occurs with very low probability. Peak distortion here is 
defined:as D: . 


D= dla. (12) 


III. THE COMPUTATIONAL METHOD 


The starting point for computing p,(-) is the set of coded error 
samples {e,}. The only other relevant parameters are V, N and o?. 
Because each random variable a, has the same pdf, and this function 
is symmetrical, one need only consider the magnitudes |e;,|, and these 
can be relabelled to rank order the samples for convenience. 


3.1 Obtaining the Error Samples 

Channel characteristics are typically measured or calculated in the 
frequency domain, although in some simple cases an analytical expres- 
sion for the channel impulse response is available. If the impulse 
response is indeed available, it is simply sampled as in equation (5) 
to obtain the error samples. Otherwise the error samples are com- 
puted from R(f) as follows.* Because by inspection 


e) 1/27 oe) 
ner) = [| Roe ap= foe YO RG-m/Taf 3) 
. 0 -1/2 m=—o 

the Fourier coefficients of R(f) = >°,, Rf — m/T) are desired. A 
Discrete Fourier Transform (DFT)—perhaps using a Fast Fourier 
Transform algorithm—of N, samples of R(f)/N,T, f; = (2 — 1)/N,T, 
a= 1, 2,--- ,N,, yields N, samples of the aliased version f(t) = 
Yin rE — mT), = (k — 1)T, & = 1, 2, +--+, N, .*' Although the 
samples of #(¢) are not identical to those of r(é), for well-behaved channels 
and sufficiently large N, the two are indistinguishable. 

The error samples of equation (5) are formed by removing unity 
from 7, . The coded error samples (for the partial response IV case) 
are then formed by subtracting from each sample the value of the 
error sample two places to the right in accordance with (7). The last 
two error samples, éy,-, and ey, are formed making use of the periodic 
nature of r(é):"' e.g., because ty,+2 = ®_ we have ey, = ty, — Xo. 
Finally, only the magnitudes are retained, and these are rank ordered. 


* Upper and lower case letters indicate Fourier Transform pairs. 
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The result is the vector e: 
€ = (1,@,°°',ev,),: 2 2°°' Bey, 20. (14) 
(Any e, = 0 could be discarded.) 


3.2 Formation of the Pdf for Binary Transmission 


The number of levels N is normally a power of two. In such cases 
the pdf p.(-) for N levels can be built up from the pdf for the binary 
case as discussed below. Thus the binary case is of fundamental im- 
portance. Ignoring the Gaussian noise for the moment, the error voltage 
z of equation (6) is 


Ne 
,.= 6 Arr (15) 
k=1 


where the e¢, are elerhents of e and for the binary case each a, takes 
on the values -EV with equal probability. There are 2”" possible realiza- 
tions of the. sequence (a; , @2 , -*- , @y,), each occuring with probability 
2-"*. Rach realization yields a value for z, (not necessarily all different), 
which contributes a delta function of weight 2-”* to the pdf of z. Since 
N, can be 100 or more, an attempt to form z for each possible sequence 
of a, would be futile (2'°° = 10°°). Instead, decomposition and quan- 
tization are used to reduce this formidable task to a sequence of rather 
simple ones. 


3.2.1 Decomposition 

The solution used here to circumvent the overwhelming computa- 
tional problem is to decompose the NV, samples into blocks of K samples, 
and to find the pdf of ISI due to the samples for each block.* The 
resulting pdf’s are convolved together to form the final binary pdf 
of z. Using specific numbers for concreteness, if N, = 54 and one chooses 


Kk = 9, then z can be written as a sum of six random variables 
A,,-°+:,Ag, where 
9i 
A;= >> ae, t=1,--:,6. (16) 
k=91-8 


Kach A, is a discrete random variable, being the sum of nine discrete 
random variables. As the pdf of each A; is formed, it is convolved 
with previous ones until all 54 samples have been included. 


* Assume for the moment that N, is an integer multiple of K. A rule-of-thumb 
for discarding some of the small error samples in order to reduce the computation 
time is described in the following text. 
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Consider the formation of A, . Since a, can take on values +V, 
there are 2° possible values for A, . The largest value, denoted as A*, 
occurs when all a, = V, and has value A* = V(e, + e. + --- + @9). 
The 2° delta functions that make up the exact pdf of A, will therefore 
lie in the interval [— A*, A*]. Since the pdf is symmetrical it is sufficient 
to compute it for nonnegative values of A, only. Every combination 
of --V values for a, , --- , d is tried. For each the corresponding value 
of A, is found according to equation (16), and is discarded if A, < 0. 

The following simple technique insures that all combinations are 
included. A 9-digit binary number B is initialized to zero, and then 
in turn incremented by one until all digits are one (2° iterations in all). 
For each value of B, the value of a, is V if the kth digit of B is 1, and 
—V otherwise. 


3.2.2 Quantization 


Decomposition alone would not significantly reduce the size of the 
computation, since the total number of delta functions in the final 
binary pdf of z would still be on the order of 2°*. Quantization of the 
voltage axis is used to keep the number of possible levels of 2 within 
reasonable bounds.’” 

The interval [0, A*] is marked off with 2N, + 1 “location” points 
t;,7 = 0,1, --- , 2N, . Associated with each ¢; is a probability p; . 
Instead of recording the exact value of A, , it is tested to see which ¢; 
it is nearest, and the corresponding p; is incremented by 27°. There 
are many roughly equivalent ways of assigning the values t; . The 
following scheme was adopted for its computational simplicity. The 
interval [0, A%] is broken into two parts at some level kA*, and then 
each part is uniformly quantized into N, levels. Hence, the ¢; are 
given by 


a ena ai ee 
For k = 1/2, the quantizing is therefore uniform over the whole 


interval. For k > 1/2 the quantizing is finer for larger values of A; , 
so that the most important levels of error voltage are most accurately 
represented. An efficient choice for k is 0.6, as shown in Section V. 
(However, uniform quantizing may have other advantages since con- 
volution could be performed using a Fast Fourier Transform 
algorithm.! 

The pdf’s of Az, --: , Ag are computed in the same way. After 
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each has been found, it is convolved with the resultant pdf for the 
previous ones. Quantization is also used in the convolution process. 


3.3 Formation of the N-Level Pdf 

Once the binary pdf for z has been obtained, it is a straightforward 
matter to find the pdf for 4-, 8-, 16-, ++ level sources. The key to this 
is to envision N level transmission as the sum of logs N independent 
binary transmission systems.’° For instance, an 8-level source with 
voltage range from —V to V can be decomposed into three binary 
sources with symbols f;, g,, and h;, having only values +V. Each 
8-level symbol a; can be written as: 


_ Af, + 29, +h 
= 7 


and the error voltage for eight levels can be written in the noise-free 
case as 





(18) 


x 
ll 


4D fie + # OS ge. +4 DS hier 
=7E, +42, + 48, (19) 


where #,, Ey, and £3 are independent random variables, all having 
the same pdf as z in the binary case. (The generalization to any power 
of 2 is straightforward.) Since z is the sum of three random variables, 
and the pdf of each is known, it is a simple matter to convolve scaled 
versions of the binary pdf to obtain the 8-level pdf. Quantization is 
used here also, employing the scheme described above. The final pdf is 
thus approximated over the interval [0, D-V] by a set of 2N; + 1 
delta-functions having locations ¢; similar to those In equation (17), 
and probabilities p;. By symmetry there is the same pattern of delta- 
functions at location —¢; with probabilities p; . 


3.4 Calculation of the Error Probability 


The Gaussian random variable n of equation (6) is reinserted by 
convolving its pdf with that found above. The result is simply 
1 ys Ween oe ot aa ae (20) 
Vo mo 


From equation (8) the error probability is then 


= 20 - Wy Sp. [o(a+)+0(Ht4)| en 


P,() = 
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where Q(x) is the normal probability integral 


Q(2) = Ti [ Ye? at, (22) 


The accuracy of the method is examined in Section V, following a 
set of examples of the method applied to typical channel distortions. 


3.5 Reducing the Computation Time by Discarding Samples 

In the preceding discussion all N error samples of equation (14) 
were involved in the computation of the pdf of ISI. In most practical 
cases, however, a sizable number of the error samples are very small, 
and consequently their effect on the nature of the pdf as calculated 
is negligible. In order to save some computation time it behooves one 
to truncate the rank-ordered vector e of (14) at some index M, and to 
lump the other samples together in some fashion. This is an optional 
procedure, since the computation time grows only approximately 
linearly with the number of blocks of K samples processed, but it can 
indeed improve the efficiency of the method. 

One way to approximate the effect of the remaining samples is to 
treat the random variable 


Ns 


A, = > Er, (23) 


K=M+1 


as a Gaussian random variable with variance‘ 
Na 
2 2: 2 
o = Ela} >) é&. (24) 
K=M+1 


The effect of these samples is thus a contribution to the Gaussian 
noise n, replacing the previous variance o” in equation (21) with o? = 
o +o. 

The question of choosing M still remains. One rule of thumb is that 
the sum of the samples lumped together should not exceed 1 percent of 
the total sum D. This was found to be very useful empirically: for all 
choices of channel studied P, was the same whether all N, samples 
were used, or only M of them, while if only M were used the saving 
in computation time was significant. 

There are some pathological cases of mainly academic interest where 
D is theoretically infinite. This is true, for instance, for an ideal band- 
limited signal with a timing error. Saltzberg has shown, using an 
upper bound method, that the error probability can still be small in 
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such cases. The present method will not work, however, since the 
value of D is a crucial landmark in the computational procedure, and 
it must be finite. Such a restriction does not significantly reduce the 
power of the method. 

IV. EXAMPLES OF ERROR RATE CALCULATION 


In each of the examples the following definitions are used: 


Error Rate = P, 








log. N ’ (25) 
_ Ble") _ h VN + 1) 
pa ~ 80° (N — 1) 


P, is normalized by the number of information bits in each symbol 
so that comparisons between different values of N will be more mean- 
ingful. Signal-to-noise ratio is given by the ratio of the variance 
E[c?] of transmitted levels to average noise power. Since twice as 
much power is transmitted (if V is unchanged) when partial response 
IV coding is used, h, = 1 for uncoded sources, and h, = 2 for partial 
response IV sources. 


(4) Error in Sampling Time: 
A convenient example of an ideal signal shaping characteristic 
S(f) is shown in Fig. 2. S(f) satisfies the Nyquist criterion 
and consequently if R(f) of Fig. 1 were equal to S(f) no inter- 
symbol interference would be present at the receiver. Instead 
it is assumed that the sampling time is in error by p7' seconds, 
so that R(f) is given by 


R(f) = S(fye?7?". (26) 


Figure 3 shows the vector of error samples e of equation (14) 







t 
t S(t)=sinctyt/T) sinc(t/T) 
~ Y=O0.4 
hg) 
i?) 

Pa La 

2T eT 


FREQUENCY, f = 


Fig. 2—Ideal signal shaping characteristic. 
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Fig. 3—Error samples e, for timing error. 


for p = 0.02 and p = 0.1, and for coded and uncoded sources. 
The samples fall off rapidly in each case, and the significant 
samples are somewhat larger for partial response IV coding. 
In order to include 99 percent of D, six blocks of nine samples 
must be used in the partial response IV case, and eight blocks 
of nine in the uncoded case. Figure 4 shows the resulting prob- 
ability density functions for an uncoded source with timing 
error p = 0.1. (In the quantization process N, = 40 so that 
81 locations were used in the interval [0, V-D].) The binary and 
8-level pdf’s are shown along with a Gaussian pdf having the 
same value of MSE. The Gaussian function is a reasonably 
good approximation to the binary case near the origin, but 
is of course very poor near zg = D-V, since the pdf’s of inter- 
symbol interference are strictly zero beyond this point. Figures 5 
and 6 show curves of error-rate versus SNR for several values 
of timing error. An 8-level source is of course much more sen- 
sitive to timing error than is a binary source. In the absence 
of intersymbol interference a partial response IV signal requires 


(7) 
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about 3 dB more signal-to-noise ratio than an uncoded signal 
to achieve the same error-rate, under a constraint of equal 
average signal power. It is clear from the figures that this 
“SNR cost”? need not be maintained when intersymbol inter- 
ference is present. The partial response IV signal is more sen- 
sitive to timing error than is the uncoded signal, such that at 
p = 0.15 the cost is around 9 dB. However, for types of dis- 
tortion other than timing error, this situation can be markedly 
reversed, since a partial response IV signal is rather impervious 
to distortions near de or the Nyquist frequency.’ 


Single Echo in the Channel: 


Echoes frequently occur in transmission channels, generated for 
instance by mismatched terminations. A sizable amount of 
analysis has been done on echo theory in relation to system 
distortion.’"’” Here a single echo of amplitude g and relative 
delay 87 is assumed to occur in the ideal channel of Fig. 2, 
so that 


R(f) = S(f)(L + ge?7"*"). (27) 


i072 






UNCODED SOURCE 
BINARY 





TIMING ERROR, p = 0.1 


D=0.457 
MSE = 0.0255 
40! 190 
| | — 1 
0.6 D-V D-V—Z 


Fig. 4—PDF of intersymbol interference. 
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Fig. 5—Error rate vs SNR for timing error. 


Because s(¢) is symmetrical and introduces no intersymbol 
interference, an echo at 87 is identical in its effect to an echo at 
(+ 6 + n)T for any integer n. For instance, echoes at 8 = 
+ 0.4, + 3.4 + 0.6 all have identical error sample vectors. 
Consequently one need only consider values of 6 between 0 
and 0.5. 

Figure 7 shows curves of error rate versus SNR for echoes 
of various strengths, and for an 8-level partial response IV 
coded source. Direct calculation shows that, because of the 
narrow excess bandwidth y of S(f) in Fig. 2, the sample error 
variance MSE of equation (10) is essentially independent of 
echo position. Echo strengths were chosen to yield convenient 
values of MSE = 0, 0.001, 0.008, and 0.005. It is clear that 
echo location has a strong effect on the error rate curves. Echoes 
at 8 = .5 yield the largest error rates. The error rate does not 
necessarily vanish as SNR — o, as indicated by the asymptote 
labelled “‘isi alone.” 


ERROR RATE 
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(22t) Equal Error Samples: 


26 






Consider the error sample vector e consisting of n identical 
elements, ¢, = u, 7 = 1, --- , n. A considerable amount of 
insight is gained by examining this case, although it is unlikely 
that any physical system would give rise to such a vector of 
error samples. In addition, one can derive the pdf analytically 
for the binary case providing a useful check on the computer 
method. 

The error voltage z of aiation (15) clearly has the value 
uV(n — 2r) if and only if r of the symbols a, equal —V, and 
the rest equal V. Such an event occurs with probability (7) 2” 
so that z has the pdf 


p.(e) = 2 e) 2” é(fe — uV(n — 2r)), (28) 


r=0 


and the error probability follows immediately using the method 


—-— UNCODED 8-LEVEL 
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Fig. 6—Error rate vs SNR for timing error. 
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Fig. 7—Error rate vs SNR for single echo. 
of Section 3.4: 


P= (") 2"Q[-V SNR [1 — u(n — 2n)]]. (29) 
r=0 
The case n = 1 is the same as an echo positioned at BT = 0. 
Figure 8 shows curves of error rate versus SNR as a function 
of n. (Curves formed using equation (29) and the method just 
given were indistinguishable.) The samples have size u = 
0.5/n so that d = 0.5 for all n, while MSE = 0.25/n. From 
the curves it is clear that a few large error samples are more 
degrading than many small ones for the same value of D. This 
is intuitively reasonable since only very special source sequences 
{a,} will cause the error voltage z to be large when e consists 
of many small error samples, and such sequences occur with 
very small probability. Figure 9 shows similar curves, where 
now the samples have sizes 0.1/+/n or 0.2/-/n, so that MSE 
is constant at 0.01 or 0.04. For MSE = 0.01 error rate is some- 
what insensitive to the number of error samples, because the 
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increasing value of D = 0.1 Vn with nis offset by the decreasing 
probability that sequences {a,} yielding large errors will occur. 
However, as 7 increases for MSE = 0.04, D approaches 1, the 
value. at which errors occur due to intersymbol ‘iaterfareniee 
alone, (D = 1 for n = 25). The offset in probability is not 
strong enough here, and error rate is very sensitive to n. 


Vv. ACCURACY OF THE METHOD AND THE SALTZBERG BOUND 


The applications of quantization at various steps in the computa- 
tion shift the locations of some of the delta-functions slightly in one 
direction or the other, introducing some error in the ¢; locations of 
equation (17). It would be extremely difficult to estimate analytically 
the error resulting in the value of P, . Instead an empirical check was 
made over several channel characteristics. Figure 10 shows how error 
rate converges with increasing N, for a typical example of channel 
distortion. Convergence was typically most rapid for k = 0.6. For 
high values of SNR uniform quantization (k = 0.5) was noticeably 
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Fig. 8—Error rate for equal error samples. 
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Fig. 9—Error rate for equal error samples. 


inferior. Because computation time grows rapidly with N,, values of 
N, = 40 and k = 0.6 were used in the examples of Section IV. With 
these values the algorithm is acceptably rapid, even on time-sharing 
facilities. 

A much faster and simpler scheme was proposed by Saltzberg.* 
This technique found an upper bound for the error probability, but it 
was not clear from the theory how tight a bound was being computed. 
Figures 11 and 12 compare the error rate using the method given 
above with the upper bound found using this technique. For binary 
transmission the bound is between one and three orders of magnitude 
above the true value. For 8-level transmission the bound is less tight. 
The discontinuities in the bound versus signal-to-noise ratio occur 
due to the optimum partitioning of the error samples into two sets in 
the upper-bound method. 


(t) Choice of the Block Size in the Method: 


Given M error samples (the M “significant”? samples from the 
original set of N,), it was desired to find the binary pdf of z in 
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equation (15). To do this, the M samples were partitioned into 
M/K blocks of K samples each. Consequently one must compute 
M/K pdf’s based on sets of K samples, and perform M/K — 1 
convolutions of these pdf’s to obtain the binary pdf of z. The 
speed of the convolution process depends on 2N, + 1, the number 
of quantization levels. By direct examination of the algorithm 
one finds: (2) that to form a pdf trying all combinations of K 
samples requires 2“ multiplications; (i7) that each convolution 
requires 5(2N, + 1)? multiplications. The total number of 
multiplications is therefore 


Naut. = 2*M/K = (M/K — 1)5(2N, oa 1)’. (30) 


For example, if N, = 40 and M = 50, a search shows that Ninuis. 
has a minimum with respect to K of 1.2 X 10° at K = 12. The 
saving in computation effort is dramatic: if no decomposition 
were used (K = M), Nwuit. would be 1.1 X 10’°. More generally, 
examination of equation (30) reveals that if M/K > 1 then 
Nuit. Varies linearly with M. Therefore Niuiz, has a minimum 
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Fig. 10—Convergence of error-rate values. 
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Fig. 11—Comparison with upper bound. 


with respect to K that depends only on N, . This minimum is 
K = 12 for N, = 40, and K = 13 for N, = 60, although Nauit. 
is rather insensitive to K for Ke (8, 15). 


VI. CONCLUSIONS 


A method has been developed that accurately computes the prob- 
ability of error for a multilevel digital signal contaminated by intersym- 
bol interference and noise. The method constructs a close approximation 
to the probability density function for the intersymbol interference, 
which can provide useful insight into the nature of the interference. 
The method is rapid enough for use on time-sharing facilities, and can 
provide a useful tool to aid the analysis and design of digital communi- 
cation systems. 

Starting with the transfer function or impulse response of the 
channel, a set of error samples is found, and the method operates on 
these to form the probability density function of intersymbol inter- 
ference. Partial response coding of the source is easily included if 
desired. The formidable size of the computation that has heretofore 
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blocked direct calculation of this probability density function is cir- 
cumvented by partitioning the problem into a sequence of easily 
handled computations. Quantization keeps the total number of ele- 
ments in the probability density function within reasonable limits, 
and the density function for binary transmission is used repeatedly 
to form the density function for 4-, 8-, 16-, --- level transmission. 
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APPENDIX 


Extension to Other Coding Types 


The method described above considers only two coding types: the 
uncoded and partial response IV cases. With slight adjustments in the 
method one can also treat other interesting cases. The class of coders 
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Fig. 12—Comparison with upper bound. 
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considered is that of Kretzmer®*: each symbol c; is related to the 
input symbols {a;} by 


Cy = boa, + byay-1 + a i a a (31) 


where the elements b; of the vector b are integers. (The uncoded case 
is then b, = 1, all other b; = 0). Kretzmer tabulates five partial response 
classes which have proven most interesting: (I) b = (1, 1); (II) b = 
(—1, 0, 2, 0, —1). Other simple types exist as well, such as b = 
(1, 0, 0, 0, 0, 0, 0, 0, 0, —1), a generalization of class IV partial response 
that has five “‘lobes’’ in the signal spectrum.” 

To see how codes of the type in equation (31) could ae included in 
the method, note that there are only three places where the coding 
scheme used makes a difference: (2) in forming the coded error samples, 
as in equation (7), (7) in finding P, from the pdf, as in equation (8), 
and (777) in evaluating signal-to-noise ratio, as in equation (25). 


(i) The coded error samples are easily found for any vector b. 
Using equation (30) in equation (4), and manipulating 


Pe YS bathynte +0 


m=0 


v 
aS Dm >= A; Lj4m + 2 
m=0 7 
» aye; +7, 

2 


x 
ll 


ll 


where 


= Dy DnLj +m (32) 
m=0 


are the coded error samples. . 

(it) Once the pdf of isi has been found from the coded error samples, 
the error probability follows as in equation (8). To use signal 
power efficiently a coder will be chosen so that the transmitted 
voltage levels are equally spaced. Thus one need only evaluate 
the probability of an outer level in order to adapt equation (8) 
to this larger class of coders. An outer level of c, occurs only 
if all relevant a, have their maximum size and correct polarity. 
For N-level a, symbols then, if M of the coefficients b; in equa- 
tion (31) are nonzero, each outer level has probability N~™. 
Thus M replaces h in equation (8). 
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(iii) Finally, since the symbols a, are statistically independent and 
have mean square value E[a’] as in equation (25), it is a simple 
matter to show that the symbols c, have mean square value 

Life 7) = hi Ela’ ], where 


= > b,,. (32) 
m=0 


Therefore h, replaces h, in equation (25) in the evaluation of 
signal-to-noise ratio. 
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